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^ (54) Title: METHOD FOR GENERATING DIVERSITY 

n 

(57) Abstract: ITie invention relates to a method for preparing an antibody-producing cell line capable of directed constitutive 
O hypermutation of a specific nucleic acid region, comprising tte steps of: a) screening a clonal cell population for V gene diversity; 

b)isolaling one or more cells which display V gene diversity and comparing the rale of accumulation of mutations in the V genes and 
^ other genes of the selected cells; and c) selecting a cell in which the rate of V gene mutation exceeds that of other gene mutation. 
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Method for Generating Diversity 



PCT/GB02/02688 



The present invention relates to a method for generating diversity in a gene or gene 
product by exploiting the natural somatic hypeimutation capability of antibody-producing 
S cells, as well as to cell lines capable of generating diversity in defined gene products. 

Many in vitro approaches to the generation of diversity in gene products rely on the 
generation of a very large number of mutants which are then selected using powerfiil 
selection technologies. For example, phage display technology has been highly successful 

10 as providing a vehicle that allows for the selection of a displayed protein (Smith, 1985; Bass 
et al , 1990; McCafferty et al, 1990; for review see Clackson and Wells, 1994). Similarly, 
specific peptide ligands have been selected for binding to receptors by afiSnity selection 
using large libraries of peptides linked to the C terminus of the lac repressor Lacl (Cull et 
al, 1992). When expressed in E. coli the repressor protein physically links the ligand to the 

15 encoding plasmid by binding to a lac operator sequence on the plasmid. Moreover, an 
entirely in vitro polysome display system has also been reported (Mattheakis et ai, 1994) in 
which nascent peptides are physically attached via the ribosome to the RNA which encodes 
them. 

20 In vivo the primary repertoire of antibody specificities is created by a process of DNA 
rearrangement involving the joining of immunoglobulhi V, D and J gene segments. 
Following antigen encounter in mouse and man, the rearranged V genes in those B cells 
that have been triggered by the antigen are subjected to a second wave of diversification, 
this time by somatic hypermutation. This hypeimutation generates the secondary 

25 repertoire fi-om which good binding specificities can be selected thereby allowing affinity 
maturation of the humoral immune response. 

Artificial selection systems to date rely heavily on initial mutation and selection, similar 
in concept to the initial phase of V-D-J rearrangement which occurs in natural antibody 
30 production, in that it results in the generation of a "fixed" repertoire of gene product 
mutants firom which gene products having the desired activity may be selected. 
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In vitro RNA selection and evolution (Ellington and Szostak, 1990), sometimes referred to 
as SELEX (systematic evoliition of ligands by exponential enrichment) (Tuerk and Gold, 
1990) allows for selection for both binding and chemical activity, but only for nucleic acids. 
When selection is for binding, a pool of nucleic adds is incubated with immobilised 
5 substrate. Non-binders are washed away, then the binders are released, amplified and the 
whole process is repeated in iterative steps to enrich for better binding sequences. This 
method can also be adapted to allow isolation of catalytic RNA and DNA (Grew and 
Szostak, 1992; for reviews see Chapman and Szostak, 1994; Joyce, 1994; Gold et oL, 1995; 
Moore, 1995). SELEX, thus, permits cyclical steps of improvement of the desired activity, 
1 0 but is limited in its scope to the preparation of nucleic acids. 

Unlike in the natural immune system, however, artificial selection systems are poorly 
suited to any facile form of "affinity maturation", or cyclical steps of repertoire generation 
and development. One of the reasons for this is that it is difficult to target mutations to 
15 regions of the molecule where they are required, so subsequent cycles of mutation and 
selection do not lead to the isolation of molecules with improved activity witii sufficient 
efficiency. 

Much of what is known about the somatic hypermutation process which occurs during 
20 affinity maturation in natural antibody production has been derived fix)m an analysis of 
the mutations that have occurred during hypermutation in vivo (for reviews see Neuberger 
and Milstein, 1995; Weill and Reynaud, 1996; Parham, 1998). Most of these mutations 
are single nucleotide substitutions which are introduced in a stepwise manner. They are 
scattered over the rearranged V domain, though with characteristic hotspots, and the 
25 substitutions exhibit a bias for base transitions. The mutations largely accumulate during 
B cell expansion in germinal centres (rather than during other stages of B cell 
differentiation and proliferation) with the rate of incorporation of nucleotide substitutions 
into the V gene during the hypermutation phase estimated at between 10"^ and lO*'^ bp"l 
generation-1 (McKean et al., 1984; Berek & Milstein, 1988) 

30 

The possibility that lymphoid cell lines could provide a tractable systrai for investigating 
hypermutation was considered many years ago (Coffino and Scharff, 1971; Adetugbo et 
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al., 1977; Briiggemami et al., 1982). Clearly, it is important that the rate of V gene 
mutation in the cell-line under study is sufficiently high not only to provide a workable 
assay but also to be confident that mutations are truly gen^ated by the localised antibody 
hypermutation mechanism rather than reflecting a generally increased mutation rate as is 
S characteristically associated witii many tumours. Extensive studies on mutation have been 
performed monitoring the reversion of stop codons in Vh in mouse pre-B and 
plasmacytoma cell lines (Wabl et al., 1985; Chui et al., 1995; Zhu et al., 1995; reviewed 
by Green et al., 1998). The alternative strategy of direct sequencing of the expressed V 
gene has indicated that Vh gene diversification in several follicular, Burkitt and Hodgkin 

10 lymphomas can continue following the initial transformation event (Bahler and Levy, 
1992; Jain et al., 1994; Chapman et al., 1995 and 1996; Braeuninger et al., 1997). Direct 
sequencing has also revealed a low prevalence of mutations in a cloned follicular 
lymphoma line arguing that Vh diversification can continue in vitro (Wu et al., 1995). 
None of the reports of constitutive mutation in cell lines cited above provides evidence 

15 that the mutations seen are the result of directed hypermutation, as observed in natural 
antibody diversification, which is concentrated in the V genes, as opposed to a general 
susceptibility to mutation as described in many tumour cell lines fi'om different lineages. 

Recently, hypermutation has been induced in a cell line by Denepoux et al (1997), by 
20 culturing cells in the presence of anti-inununoglobulin antibody and activated T-cells. 
However, the hypermutation observed was stated to be induced, not constitutive. 

Summary of the Invention 

25 In a first aspect of the mvention there is provided a method for preparing a lymphoid cell 
line capable of directed constitutive hypermutation of a target nucleic acid region, 
comprising screening a cell population for ongoing target sequence diversification, and 
selecting a cell in which the rate of target nucleic acid mutation exceeds that of other 
nucleic acid mutation by a fector of 100 or more. 

30 

As used herein, "durected constitutive hypermutation" refers to the ability, observed for 
the first time in experiments reported herein, of certain cell lines to cause alteration of the 
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nucleic acid sequence of one or more specific sections of endogenous or transgene DNA 
in a constitutive manner, that is without the requirement for external stimulation. In cells 
capable of directed constitutive hypermutation, sequences outside of the specific sections 
of endogenous or transgene DNA are not subjected to mutation rates above background 
S mutation rates. 

A "target nucleic acid region" is a nucleic acid sequence or region in the cell according to 
the invention which is subjected to directed constitutive hypennutation. The target 
nucleic acid may comprise one or more transcription units encoding gene products, which 

10 may be homologous or heterologous to the cell. Exemplary target nucleic acid regions are 
immunoglobulin V genes as found in immunoglobulin-producing cells These genes are 
imder the influence of hypermutation-recruiting elements, as described further below, 
which direct the hypennutation to the locus in question. Other target nucleic acid 
sequences may be constmcted, for example by replacing V gene transcription units in loci 

15 which contain hypermutation-recruiting elements with another desired transcxription unit, 
or by constructing artificial genes comprising hypermutation-recruiting elements. 

"Hypennutation" refers to the mutation of a nucleic acid in a cell at a rate above 
background. Preferably, hypennutation refers to a rate of mutation of between 10"^ and 
20 10"^ bp"^ generation^ This is greatiy in excess of background mutation rates, which are 
of the order of 10"^ to 10"^^ mutations bp^ generation ^ (Drake et al, 1988) and of 
spontaneous mutations observed in PGR. 30 cycles of amplification with Pfii polymerase 
would produce <0.05xl0"^ mutations bp'^ in the product, which in the present case woxdd 
account for less than 1 in 100 of flie observed mutations (Lundberg et al, 1991). 

25 

Hypennutation is a part of the natural generation of immunoglobulin variable chain (V) 
genes. According to the present invention therefore, the cell line is preferably an 
immunoglobulin-producing cell line which is capable of producing at least one 
immunoglobulin V gene. A V gene may be a variable light chain (Vt) or variable heavy 
30 chain (Vh) gene, and may be produced as part of an entire immunoglobulin molecule; it 
may be a V gene fi:om an antibody, a T-cell receptor or another member of the 
immunoglobulin supCTfamily. Members of the immxmoglobulin siq)erfainily are involved 
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in many aspects of cellular and non-cellular interactions in vivo, including widespread 
roles in the immune system (for example, antibodies, T-cell receptor molecules and the 
like), involvement in cell adhesion (for example the ICAM molecules) and mtracellular 
signalling (for example, receptor molecules, such as the PDGF receptor). Thus, preferred 
5 cell lines according to the invention are derived from B-cells. According to the present 
invention, it has been determined that cell lines derived from antibody-producing B cells 
may be isolated which retain the ability to hypennutate V region genes, yet do not 
hypermutate other genes. 

10 In a preferred embodiment, the cells according to the invention are derived from or related 
to cells which hypermutate in vivo. Cells which hypermutate in vivo are, for example, 
immunoglobulin-expressing cells, such as B-cells. Lymphoma cells, which are Ig- 
expressing cell tumours, are particularly good candidates for the isolation of constitutively 
hypermutating cell lines according to the present invention. 

15 

As used herein, "screening for ongoing target sequence diversification" refers to the 
determination of the presence of hypennutation in the target nucleic acid region of the cell 
lines being tested. This can be performed in a variety of ways, including direct 
sequencing or indirect methods such as the MutS assay (Jolly et al, 1997) or monitoring 
20 the generation of hnmunoglobulin loss variants. Cells selected according to this 
procedure are cells which display target sequence diversification. 

The cell population which is subjected to selection by the method of the mvention may be 
a polyclonal population, comprising a variety of cell types and/or a variety of target 
25 sequences, or a (mono-) clonal population of cells. 

A clonal cell population is a population of cells derived from a single clone, such that the 
cells would be identical save for mutations occurring therein. Use of a clonal cell 
population preferably excludes co-culturing with other cell types, such as activated T- 
30 cells, with the aim of inducing V gene hypennutation. 



L 
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Cells according to the invention do not rely on the use of induction steps in order to 
produce hypermutation. 

Preferably, the clonal cell population screened in the present invention is derived from a B 
5 cell. Advantageoiisly it is a lymphoma cell line, such as a Burkitt lymphoma cell line, a 
follicular lymphoma cell line or a difiuse large cell lymphoma cell line. 

Preferably, tiie method according to the invention further comprises the steps of isolating 
one or more cells v^ch display target sequence diversification, and comparing the rate of 
10 accumulation of mutations in the target sequences with that in non-target sequences in the 
isolated cells. 

A feature of the present invention is that the hypermutation is directed only to specific 
(target) nucleic acid regions, and is not observed outside of these regions in a general 

15 manner. Specificity is thus assayed as part of Ihe method of the invention by assaying the 
rate of mutation of sequences other than target sequences, C region genes, which are not 
naturally exposed to hypermutation, may advantageously be employed in such a 
technique, although any other nucleic acid region not subject to specific hypermutation 
may also be used. Since hypermutation is not sequence dependent, the actual sequence of 

20 the nucleic acid region selected for comparison purposes is not important However, it 
must not be subject to control sequences which dnect hypermutation, as described below. 
Conveniently, background mutation may be assessed by fluctuation analysis, for ejcample 
at the HPRT locus [see Luria and Delbreck , (1943); Capizzi and Jameson, (1973)]. 

25 Cells in which target region mutation exceeds non-target region mutation are cells capable 
of directed constitutive hypermutation of a specific nucleic acid region in accordance with 
the present invention. The factor by which V region gene mutation exceeds other gene 
mutation is variable, but is in general of the order of at least 10^' advantageously 10^, and 
preferably 10"^ or more. 

30 

Overall mutation rates and diversity may be increased, for example by the administration 
of mutagens or expression of sequence modifying genes, such as temmial 
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deoxynucleotidyl transferase (TdT). However, the diflFerence between hypennxitation and 
background is not expected to be increased in such a manner. 

Preferred cells according to the invention may be subject to gene manipulation, such as 
5 gene deletion, conversion or insertion, in order to increase the rate of somatic 
hypermutation observed therein. For example, the cells according to the invention may 
lack one or more copies of a RAD51 paralogue. 

The cells may be any suitable vertebrate cells, including mammalian and avian cells. 

10 

In a second aspect of the present mvention, there is provided a method for preparing a 
gene product having a desired activity, comprising the steps of: 

a) expressing a nucleic acid encoding the gene product in a population of 
cells according to the first aspect of the present invention, operably linked to a nucleic 

1 5 acid which directs hypermutation; 

b) identifying a cell or cells within the population of cells which expresses a 
mutant gene product having the desired activity; and 

c) establishing one or more clonal populations of cells from the cell or cells 
identified in step (b), and selecting fi:om said clonal populations a cell or cells which 

20 expresses a gene product having an improved desired activity. 

The population of cells according to part a) above is derived from a clonal or polyclonal 
population of cells which comprises cells identified by a method according to the first 
aspect of the invention as being capable of constitutive hypermutation of V region genes. 
25 The gene product may thiis be the endogenous immunoglobulin polypeptide, a gene 
product expressed by a manipulated endogenous gene or a gene product expressed by a 
heterologous transcription unit operatively linked to control sequences which direct 
somatic hypermutation, as described further below. 

30 The nucleic acid which is expressed in the cells of the invention and subjected to 
hypermutation may be an endogenous region, such as the endogenous V region, or a 
heterologous region inserted into the cell line of the invention. This may take form, for 
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example, of a replacement of the endogenous V region with heterologous transcription 
unit(sX such as a heterologous V region, retaining the endogenous control sequences 
which direct hypermutation; or of the insertion into the cell of a heterologous transcription 
unit under the control of its own control sequences to direct hypeimutation, wherein the 
5 transcription unit may encode V region genes or any other desired gene product. The 
nucleic acid according to the invention is described in more detail below. 

In step b) above, the cells are screened for the desired gene product activity. This may be, 
for example in the case of immunoglobulins, a binding activity. Other activities may also 

10 be assessed, such as enzymatic activities or the like, using appropriate assay procedures. 
Where the gene product is displayed on the surface of the cell, cells which produce the 
desired activity may be isolated by detection of the activity on the cell surface, for 
example by fluorescence, or by immobilising the cell to a substrate via the surface gene 
product. Where the activity is secreted into the growth mediimi, or otherwise assessable 

15 only for the entire cell culture as opposed to in each individual cell, it is advantageous to 
establish a plurality of clonal populations from step a) in order to increase the probability 
of identifying a cell which secretes a gene product having the desired activity. 
Advantageously, the selection system employed does not affect the cell's ability to 
proliferate and mutate. 

20 

Preferably, at this stage (and in step c) cells which express gene products having a better, 
improved or more desirable activity are selected. Such an activity is, for example, a 
higher aflSnity binding for a given ligand, or a more effective enzymatic activity. ITius, 
the method allows for selection of cells on the basis of a qualitative and/or quantitative 

25 assessment of the desired activity. 

In a third aspect of the present invention, there is provided the use of a cell capable of 
directed constitutive hypermutation of a specific nucleic acid region in the preparation of 
a gene product having a desired activity. 

30 

In the use according to the invention, a nucleic acid encoding the gene product having the 
desired activity is operatively Imked to control sequences which direct hypermutation 



wo 02/100998 PCT/GB02/02688 

9 

within the cell. Successive generations of the cell thus produce mutants of the nucleic 
acid sequence, which are screened by the method of the invention to isolate mutants with 
advantageous properties. 

5 In a fiirther aspect, the invention relates to a cell capable of directed constitutive 
hypermutation in accordance with the invention. Preferably, the cell is a genetically 
manipulated chicken DT40 cell. As described above, one or more DNA-repair genes may 
be manipulated. Preferably, one or more RadSl genes are manipulated. Advantageously, 
the genes are downregulated or deleted. Preferably, the genes are RadSlb or RadSlc 
10 genes. 

In a highly preferred embodiment, the invention provides a cell selected from the group 
consisting of A xrcc2 DT40and A xrcc3 DT40 

15 

Brief Description of the Figures 

Figure 1 Vh diversity in Burldtt lines. 

(A) Sequence diversity in the rearranged Vh genes of four sporadic Burkitt 
20 lymphoma lines, shown as pie charts. The number of Ml 3 clones sequenced for each cell 

line is denoted in the centre of the pie; the sizes of the various segments depict the 
proportion of sequences that are distinguished by 0, 1, 2 etc. mutations (as indicated) from 
the consensus. 

(B) Presumed dynastic relationship of Vh mutations identified in the initial 
25 Ramos culture. Each circle (with shading proportional to extent of mutation) represents a 

distinct sequence with the number of mutations accumulated indicated within the circle. 

(C) Mutation prevalence m the rearranged Vx genes. Two V^ rearrangements 
are identified in Ramos. Diversity and assignment of germUne origin is presented as in 
Figure lA. 

30 

(D) Comparison of mutation prevalence m the Vh and C\i regions of the initial 
Ramos culture. Pie charts are presented as in Figure 1 A. 
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Figure 2 Constitutive Vh diversification in Ramos. 

(A) Diversification assessed by a MutS assay. The mirtation prevalence in each 
population as deduced by direct cloning and sequencing is indicated. 
5 (B) Dynastic relationships deduced from the progeny of three independent 

Ramos clones. 

Figure 3. Distribution of unselected nucleotide substitutions along the Ramos Vh- 

1 0 Figure 4, Hypermutation in Ramos generates diverse revertible IgM-loss variants. 

(A) Scheme showing the isolation of IgM-loss variants. 

(B) Table showmg that multiple nonsense mutations can contribute to Vh 
inactivation. Each Vh codon position at which stops are observed in these two 
populations is listed. 

1 5 (C) Table of reversion rates of IgM-loss variants. 

(D) Sequence surrounding the stop codons in the IgM-loss derivatives. 

Figure 5. IgM-loss variants in Ramos transfectants expressing TdT. 

(A) Western blot analysis of expression of TdT in three pSV-ppG/TdT and 
20 three control transfectants of Ramos, 

(B) Pie charts depicting independent mutational events giving rise to IgM-loss 
variants. 

Figure 6. Sequence table summarising mutations in Vh other than single nucleotide 
25 substitutions. 

Figure 7. Comparison of sequences isolated from Vh genes of Ramos cells vMch 
have lost anti-idiotype (anti-Idl) binding specificity. Nucleotide substitutions which 
dififer from the starting population consensus are shown in bold. Predicted amino acid 
30 changes are indicated, also in bold type. 
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Figure 8. Bar graph showing enrichment of Ramos cells for production of an 
inmiunoglobulin with a novel binding specificity, by iterative selection over five rounds. 

Figure 9. Bar graph showing improved recovery of Ramos cells binding a novel 
5 specificity (streptavidin) by increasing the bead:cell ratio. 

Figure 10. Chart showing increase in recovery of novel binding specificity Ramos 
cells according to increasing target antigen concentration. 

10 Figure IL Vh sequence derived fi-om streptavidin-binding Ramos cells. Nucleotide 
changes observed in comparison with the Vh sequence of the starting population, and 
predicted anaino acid changes, are shown in bold. 

Figure 12. Amount of IgM in supematants of cells selected in rounds 4, 6 and 7 of a 
IS selection process for streptavidin binding, against control medium and unselected Ramos 
cell supernatant. 

Figure 13. Streptavidin bindmg of IgM fi:om the supematants of Figure 12. 

20 Figure 14. Streptavidin binding of supematants firom round 4 and round 6 of a 
selection for streptavidm bindmg, analysed by surface plasmon resonance. 

Figiure 15. FACS analysis of bindmg to streptavidin-FUC of cells selected in rounds 
4 and 6. 

25 

Figure 16. Vh and Vl sequences of round 6 selected IgM. 

Figure 17. FACS analysis of affinity matured Ramos cells selected against 
streptavidin. 

30 

Figure 18. ELISA of affinity matured Ramos cells. 
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Figure 19. sIgM-loss variants in wild-type and repair deficient DT40. 

(A) flow cytometric analysis of sIgM heterogeneity in wild type and repair 
deficient cells. 

(B) fluctuation analysis of the firequency of generation of sIgM-loss variants. 

5 

Figure 20. Analysis of sequences cloned firom sIgM variants of DT40. 

Figure 21 . Analysis of Ig sequences of unsorted DT40 populations after one month of 
clonal expansion. 

10 

Figure 22. Analysis of sIgM loss variants of DT40 cells deficient in DNA-PK, Ku70 
andRadSlB. 

Figure 23. A dynasty of IgMs firom DT40 specific for a rat immunoglobulin idiotype. 

15 Cells were subjected to six rounds of selection using an aggregate of biotinylated-rat S7 
IgG2a, I monoclonal antibody (Ab)/FITC-streptavidin to yield DT-Ab3, which was then 
siibjected to a further three rounds of sorting with a direct PE-Ab conjugate to yield 
DTAb6. (a) Analysis of the binding of biotinylated Ab/FTTC-Strep aggregate as well as of 
PE-Ab to DT-Ab3 and DT-Ab6 cells, (b) Binding of IgM in the supernatant of DTAb3,5 

20 and 6 cells to the S7 Ab monitored by ELISA using plates coated with S7 Ab and 
detection with anti-chicken IgM. Supematants were controlled for total IgM titres. No 
bindiag by the DT-Ab6 IgM was detected to a wide variety of rat hybridoma and 
chimaeric mAbs of different isotypes (generously provided by G. Butcher and M 
Briiggemann (not shown)), (c) Binding of DT-Ab6 IgM to S7 rat IgG Ab coated on to the 

25 plate is competed by S7 Ab itself but scarcely by normal rat serum. The S7 (stock at 0.5 
mg/ml) and rat serum competitors were used at the dilutions indicated, (d) DT-Ab6 IgM 
can be used to stain S7 hybridoma cells. Staining was detected by flow cytometry of 
fixed, permeabilised cells, detecting using FITC-conjugated anti-chicken IgM. (e) 
Comparison of the staining of DT-Ab3 and DT-Ab6 cells by PE-Ab conjugate using 

30 various dilutions of the original 0.2 mg/ml stock (PE-S7 Ab; Pharamacia). (f) AflBboity 
determination. DT-Ab6 cells (10^ were incubated with 0.2 picomoles of PE-conjugated 
S7 Ab in various volumes to give the PE-Ab concentrations mdicated; the mean 
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fluorescence intensity (MFT) of the cells determined foUowing washing, (g) Comparison 
of the IgVHA^L sequences of DT-Ab6 and the parental population 

Figure 24. A dynasty of Protein A-specific IgMs from DT40. (a) Selection of DT40 
S variants binding to derivatised magnetic beads showing the number of cells recovered 
following incubation of lOs cells with 106 beads. DT-Pl and P2 were selected on 
streptavidin-beads coated with biotinylated-Protein A whereas DT-P3 was selected from 
DT-P2 on tosylated-magnetic beads directly coated with Protein A. (b) Comparison of 
staLning of subclones of various DT-P cells by an ^gregate of biotinylated Protein A with 

10 FITC-streptavidin, by Protein A that had been directly conjugated with FITC and by 
FITC-anti-IgM. (c) Binding of IgM in the supernatant of parental DT40 and DT-P14 bulk 
population to Protein A-coated plates monitored by ELISA using a mouse IgM mAb 
specific for chicken IgM (Southem Biotechnology) for detection. Total IgM titres were 
also compared as a control, (d) IgM from the supernatant of DT-P4, -P6 and P7 clones 

15 (but not from parental DT40) can be purified on Protein A, using Westem blot to detect 
chicken IgM retained on the Protein A-Sepharose. (e) Binding of [35S]Protein A to DTP 
cells. Parental DT40 (DT) or DT-P cells (SxlOs cells in 0.1 ml)) were incubated on ice for 
Ih with [35S]Protein A (48,000 bindable cpm; 200-2000 Ci/mmol); bound cpm were 
determined following a single PBS wash, (f) Affinity determination. DT-P 14 cells (106) 

20 were incubated with 0.7 picomoles [3sS]Protein A (TxlOio bindable cpm/|xmole) in various 
volumes to yield the Protein A coucratrations indicated; bound cpm were determined 
following washing, (g) Staining of DT-P4 and DT-P14 cells with FTTCProtein A is 
enhanced by performing the staining in the presence (P4+, P14+) of Img/nxl unlabelled 
rabbit IgG. (h) IgV gene sequences from DT-P subclones. 

25 

Figure 25. Antigen-selection of DT40 can yield polyreactive IgMs with the possibility 
of maturation into specificity. DT-HIO cells were obtained by ten sequential rounds of 
selection from the parental Xrcc2-deficient DT40 pool using carboxylated magnetic beads 
to which human serum albumin (HSA) had been coupled. DT-TIO and DT-06 cells were 
30 obtained by serial rounds of flow cytometric selection of cells stained with aggregates of 
FITC-slreptavidin with either biotinylated thyrogobulin (DT-T cells) or ovalbumm (DT-0 
cells), taking the brightest 2% of the population in each round, (a) Flow cytometric 
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analysis of DT-H6 and DT-T6 cells stained with the reagents indicated (Cy-strep, 
cychrome-streptavidin). (b) Antigen binding by DT-H6 and DT-T6 cells is mediated by 
the surface IgM as witnessed by 2D flow cytometric analysis of DT-H6 and DT-T6 
subpopulations that have been enriched for sIgM-loss variants, (c) ELISA of DTH6 and 
S DT-T6 culture supematants, monitoring chicken IgM specific for the antigens indicated 
(d) DT-06 cells (selected with ovalbumin £^egates) also bind Tg (as well as several 
other antigens tested (not shown)) but sequential sorting for FTTC-Ovabrigiit 
/Cystreptavidinduu cells yields a population exhibiting greater specificity for Ova. 

1 0 Figure 26. Analysis of naturally-occuring constitutively hypennutating BL cell lines. 



Detailed Description of the Invention 

15 The present invention makes available for the first time a cell line which constitutively 
hypermutates selected nucleic acid regions. This permits the design of systems which 
produce mutated gene products by a technique which mirrors affinity maturation in 
natural antibody production. The Ramos Burkitt line constitutively diversifies its 
rearranged immunoglobulin V gene during in vitro culture. This hypermutation does not 

20 require stimulation by activated T cells, exogenously-added cytokines or even 
maintenance of the B cell antigen receptor. 

The rate of mutation (which lies in the range 0.2-lxl0"4 bp"l generation'^) is sufficiently 
high to readily allow the accumulation of a large database of unselected mutations and so 

25 reveal that hypermutation in Ramos exhibits most of the features classically associated 
with immunoglobulin V gene hypemiutation in vivo (preferential targeting of mutation to 
the V; stepwise accumulation of single nucleotide substitutions; transition bias; 
characteristic mutational hotspots). The large majority of mutations in the unselected 
database are single nucleotide substitutions although deletions and duplications 

30 (sometimes with a flanking nucleotide substitution) are detectable. Such deletions and 
duplications have also been proposed to be genemted as a consequence of hypermutation 
in vivo (Wilson et al., 1998; Goosens et al., 1998; Wu & Kaartinen, 1995). 
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The isolation of cells which constitutively hypennutate selected nucleic acid regions is 
based on the monitoring of V gene mutation in cell lines derived from antibody-producing 
cells such as B cells. Tlie selection method employed in the invention may be configured 
S in a number of ways. 

Selection of Hvpennutating Cells 

Hypermutating cells may be selected from a population of cells by a variety of techniques, 
10 including sequencing of target sequences, selection for expression loss mutants, assay 
using bacterial MutS protein and selection for change in gene product activity. 

One of the features of hypermutation of target nucleic acids is that the process results in 
the introduction of stop codons into the target sequence with far greater frequency than 
1 5 would be observed in the absence of hypermutation. This results in loss of production of 
a gene product from the cell. This loss may be exploited to identify cells which are 
hypermutating nucleic acid sequences. 

In a preferred embodiment of the invention, the target nucleic acid encodes an 
20 immunoglobulin. Immunoglobulin loss may be detected both for cells which secrete 
immunoglobulins into the culture medium, and for cells in which the immunoglobulin is 
displayed on the cell surface. Where the immunoglobulin is present on the cell surface, 
its absence may be identified for individual cells, for example by FACS analysis, 
immunofluorescence microscopy or ligand immobilisation to a support. In a preferred 
25 embodiment, cells may be mixed with antigen-coated magnetic beads which, when 
sedimented, will remove from the cell suspension all cells having an immunoglobulin of 
the desked specificity displayed on the siuface. 

The technique may be extended to any immimoglobulin molecule, including antibodies, 
30 T-cell receptors and the like. The selection of inununoglobulin molecules will depend on 
the nature of the clonal population of cells vMch it is desired to assay accordiug to the 
invention. 
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Alternatively, cells according to the invention may be selected by sequencing of target 
nucleic acids, such as V genes, and detection of mutations by sequence comparison. This 
process may be automated in order to increase throughput 

5 

In a further embodiment, cells which hypermutate V genes may be detected by assessing 
change in antigen binding activity iq the immunoglobulins produced in a clonal cell 
population. For example, the quantity of antigen bound by a specific unit amoimt of cell 
medium or extract may be assessed in order to determine the proportion of 
10 immunoglobulin produced by the cell which retains a specified binding activity. As the V 
genes are mutated, so binding activity will be varied and the proportion of produced 
immunoglobulin which binds a specified antigen will be reduced. 

Alternatively, cells may be assessed in a similar maimer for the ability to develop a novel 
IS binding affinity, such as by exposing them to an antigen or mixture of antigens which are 
initially not bound and observing whether a binding a£5nity develops as the result of 
hypermutation. 

In a further embodiment, the bacterial MutS assay may be used to detect sequence 
20 variation in target nucleic acids. The MutS protein binds to mismatches in nucleic acid 
hybrids. By creating heteroduplexes between parental nucleic acids and those of 
potentially mutated progeny, the extent of mismatch formation, and thus the extent of 
nucleic acid mutation, can be assessed. 

25 Where the target nucleic acid encodes an gene product other than an immunoglobulin, 
selection may be performed by screening for loss or alteration of a function other than 
binding. For example, the loss or alteration of an enzymatic activity may be screened for. 

Cells which target sequence hypermutation are assessed for mutation in other nucleic acid 
30 regions. A convenient region to assay is the constant (C) region of an immunoglobulin 
gene. C regions are not subject to directed hypermutation according to the invention. The 
assessment of C regions is preferably made by sequencing and comparison, since tiiis is 
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the most certain method for determining the absence of mutations. However, other 
techniques may be employed, such as monitoring for the retention of C region activities, 
for example complement fixation, which may be disrupted by hypermutation events. 

5 Genetic Manipulation of cells 

Hypermutating cells according to the invention may be selected fix)m cells which have 
been genetically manipulated to enhance rates of hypermutation in the Ig V-region. Genes 
which are responsible for modulation of mutation rates include, in general, in nucleic acid 
1 0 repair procedures in the cell. Genes which are manipulated in accordance with the present 
invention may be upregulated, downregulated or deleted. 

Up- or down-regulation refers to an increase, or decrease, in activity of the gene product 
encoded by the gene in question by at least 10%, preferably 25%, more preferably 40, 50, 
15 60, 70, 80, 90,95, 99% or more. Upregulation may of course represent an increase in 
activity of over 100%, such as 200% or 500%. A gene which is 100% downregulated is 
functionally deleted and is referred to herein as "deleted". 

Preferred genes manipulated in accordance with the present invention include analogues 
20 and/or paralogues of the RadS 1 gene, in particular xrcc2, xrcc3 and Rad5 Ibgenes. 

Rad51 analogues and/or paralogues are advantageously dowru:egulated, and preferably 
deleted. Downregulation or deletion of one or more RadSl paralogues, gives rise to an 
increase in hypermutation rates in accordance with the invention. Preferably, two or more 
25 RadSl genes, including analogues and/or paralogues thereof, are downregulated or 
deleted. 

In a highly preferred embodiment, avian cell lines such as the chicken DT40 ceil line are 
modified by deletion of xrcc2 and/or xrcc3. A xrcc2 DT40 as well Axrcc3-DT40 are 
30 constitutively hypermutating cell lines isolated in accordance with the present invention. 
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Adaptation of the endogenous gene products 

Having obtained a cell line which constitutively hypennutates an endogenous gene, such 
as an immunoglobxilin V region gene, the present invention provides for the adaptation of 
5 the endogenous gene product, by constitutive hypennutation, to produce a gene product 
having novel properties. For example, the present invention provides for the production 
of an immunoglobulin having a novel binding specificity or an altered binding affinity. 

The process of hypermutation is employed, in nature, to generate improved or novel 
10 binding specificities in immunoglobulin molecules. Thus, by selecting cells according to 
the invention which produce immunoglobulins capable of binding to the desired antigen 
and then propagating these cells in order to allow the generation of further mutants, cells 
which express inmiunoglobulins having improved binding to the desired antigen may be 
isolated. 

15 

A variety of selection procedures may be applied for the isolation of mutants having a 
desired specificity. These include Fluorescence Activated Cell Sorting (FACS), cell 
separation using magnetic particles, antigen chromatography methods and other cell 
separation techniques such as use of polystyrene beads. 

20 

Separating cells using magnetic capture may be accomplished by conjugating the antigen 
of interest to magnetic particles or beads. For example, the antigen may be conjugated to 
superparamagnetic iron-dextran particles or beads as supplied by Miltenyi Biotec GmbH. 
These conjugated particles or beads are then mixed with a cell population which may 

25 express a diversity of surface immimoglobulins. If a particular cell expresses an 
immxinoglobulin capable of binding the antigen, it will become complexed with the 
magnetic beads by virtue of this interaction. A m^etic field is then applied to the 
suspension which immobilises the magnetic particles, and retains any cells which are 
associated with them via the covalentiy linked antigen. Unbound cells which do not 

30 become linked to the beads are then washed away, leaving a population of cells which is 
isolated purely on its ability to bind the antigen of interest Reagents and kits are 
available firom various sources for performing such one-step isolations, and include Dynal 
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Beads (Dynal AS; http://www.dynal.no), MACS-Magnetic Cell Sorting (Miltenyi Biotec 
GmbH; ht^://www.miltenyibiotec.coin), CliniMACS (AmCell; http://www.amcell.com) 
as well as Biomag, Amerlex-M beads and others. 

S Fluorescence Activated Cell Sorting (FACS) can be used to isolate cells on the basis of 
their differing surface molecules, for example surface displayed immimoglobulins. Cells 
in the sample or population to be sorted are stained with specific fluorescent regents 
which bind to the cell surface molecules. These reagents would be the antigen(s) of 
interest linked (either directly or indirectly) to fluorescent markers such as fluorescein, 

10 Texas Red, malachite green, green fluorescent protein (GFP), or any other fluorophore 
known to those skilled in the art. The cell population is then introduced into the vibrating 
flow chamber of the FACS machine. The cell stream passing out of the chamber is 
encased in a sheath of buffer fluid such as PBS (Phosphate Buffered Saline). The stream 
is illuminated by laser light and each cell is measured for fluorescence, indicating binding 

15 of the fluorescent labelled antigen. The vibration in the cell stream causes it to break up 
into droplets, which cany a small electrical charge. These droplets can be stewed by 
electric deflection plates mder computer control to collect different cell populations 
according to theh: affinity for Hie fluorescent labelled antigen. In this manner, cell 
populations which exhibit different affinities for the antigen(s) of interest can be easily 

20 separated from those cells which do not bind the antigen. FACS machines and reagents 
for use in FACS are widely available from sources world-wide such as Becton-Dickinson, 
or from service providers such as Arizona Research Laboratories 
(http://www.arl.arizona.edu/facs/). 

25 Another method which can be used to separate populations of cells according to the 
affinity of their cell surface protein(s) for a particular antigen is affinity chromatography. 
In this method, a suitable resin (for example CL-600 Sepharose, Pharmacia Inc.) is 
covalently linked to the appropriate antigen. This resin is packed into a column, and the 
mixed population of cells is passed over the column. After a suitable period of incubation 

30 (for example 20 minutes), unbound cells are washed away using (for example) PBS 
buffer. This leaves only that subset of cells expressing immunoglobulins which bound the 
an1igen(s) of interest, and these cells are then eluted from the colunm using (for example) 
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an excess of the antigen of interest, or by enzymadcally or chemically cleaving the antigen 
from the resin. This may be done using a specific protease such as factor X, thrombin, or 
other specific protease known to those skilled in the art to cleave tiie antigen from the 
column via an appropriate cleavage site which has previously been incorporated into the 
S antigen-resin complex. Alternatively, a non-specific protease, for example trypsin, may 
be employed to remove the antigen from the resin, thereby releasing that population of 
cells which exhibited afOnity for the antigen of interest. 



10 Insertion of heterologous transcription units 

In order to maximise the chances of quickly selecting an antibody variant capable of 
binding to any given antigen, or to exploit the hypermutation system for non- 
immunoglobulin genes, a number of techniques may be employed to engineer cells 
1 5 according to the invention such that their hypermutating abilities may be exploited. 

In a first embodiment, transgenes are transfected into a cell according to the invention 
such that the transgenes become targets for the directed hypermutation events. 

20 As used herein, a "transgene** is a nucleic acid molecule which is inserted into a cell, such 
as by transfection or transduction. For example, a "transgene" may comprise a 
heterologous transcription unit as referred to above, which may be inserted into the 
genome of a cell at a desired location. 

25 The plasmids used for delivering the transgene to the cells are of conventional 
construction and comprise a coding sequence, encoding the desired gene product, under 
the control of a promoter. Gene transcription from vectors in cells according to the 
invention may be controlled by promoters derived from the genomes of viruses such as 
polyoma virus, adenovirus, fowlpox virus, bovine papilloma virus, avian sarcoma virus, 

30 cytomegalovirus (CMV), a retrovirus and Simian Virus 40 (SV40), from heterologous 
mammalian promoters such as the actin promoter or a very strong promoter, e.g. a 
ribosomal protein promoter, and from the promoter normally associated with the 
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heterologous coding sequence, provided such promoters are compatible with the host 
system of the invention. 

Transcription of a heterologous coding sequence by cells according to the invention may 
5 be increased by inserting an enhancer sequence into the vector. Enhancers are relatively 
orientation and position independent. Many enhancer sequences are known JBrom 
mammalian genes (e.g. elastase and globin). However, typically one will employ an 
enhancer fix>m a eukaryotic cell virus. Examples include the SV40 enhancer on the late 
side of the replication origin (bp 100-270) and the CMV early promoter enhancer. The 
10 enhancer may be spliced into the vector at a position 5' or 3' to the coding sequence, but is 
preferably located at a site 5' from the promoter. 

Advantageously, a eukaryotic expression vector may comprise a locus control region 
(LCR). LCRs are capable of directing high-level integration site independent expression 
IS of transgenes integrated into host cell chromatin, which is of importance especially where 
the heterologous coding sequence is to be expressed in the context of a permanentiy- 
transfected eukaryotic cell line in vMch chromosomal integration of the vector has 
occurred, in vectors designed for gene therapy applications or in transgenic animals. 

20 Eukaryotic expression vectors will also contain sequences necessary for flie termination of 
transcription and for stabilising the mRNA. Such sequences are commonly available from 
the 5* and 3* untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions 
contain nucleotide segments transcribed as polyadenylated fragments in the untranslated 
portion of the mRNA. 

25 

An expression vector includes any vector capable of expressing a coding sequence 
encoding a desired gene product that is operatively linked with regulatory sequences, such 
as promoter regions, that are capable of expression of such DNAs. Thus, an expression 
vector refers to a recombinant DNA or RNA construct, such as a plasmid, a phage, 
30 recombinant virus or other vector, that upon introduction into an appropriate host cell, 
results in expression of the cloned DNA. Appropriate expression vectors are well known 
to those with ordinary skill in the art and include those that are replicable in eukaryotic 
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and/or prokaiyotic cells and those that remain episomal or those which integrate into the 
host cell genome. For example, DNAs encoding a heterologous coding sequence may be 
inserted into a vector suitable for expression of cDNAs in mammalian cells, e.g. a CMV 
enhancer-based vector such as pEVRF (Matthias, et al., 1989). 

5 

Construction of vectors according to tiiie invention employs conventional ligation 
techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and religated in the 
form desired to generate the plasmids required. If desired, analysis to confirm correct 
sequences in the constructed plasmids is performed in a known fashion. Suitable methods 

10 for constructing expression vectors, preparing in vitro transcripts, introducing DNA into 
host cells, and performing analyses for assessing gene product expression and function are 
known to those skilled in the art. Gene presence, amplification and/or ejq)ression may be 
measured in a sample directly, for example, by conventional Southern blotting, Northem 
blotting to quantitate the transcription of mRNA, dot blotting (DNA or RNA analysis), or 

15 in situ hybridisation, using an appropriately labelled probe which may be based on a 
sequence provided herein. Those skilled in the art will readily envisage how these 
methods may be modified, if desired. 

In one variation of the first embodiment, transgenes according to the invention also 
20 comprise sequences which direct hypermutation. Such sequences have been 
characterised, and include those sequences set forth in Klix et al, (1998), and Sharpe et 
al, (1991), incorporated herem by reference. Thus, an entire locus capable of expressing 
a gene product and directing hypermutation to the transcription unit encoding the gene 
product is transferred into the cells. The transcription unit and the sequences which direct 
25 hypermutation are thus exogenous to the cell. However, although exogenous the 
sequences which direct hypermutation themselves may be similar or identical to the 
sequences which direct hypermutation naturally found in the cell 

In a second embodiment, the endogenous V gene(s) or segments thereof may be replaced 
30 with heterologous V gene(s) by homologous recombination, or by gene targeting using, 
for example, a Lox/Cre system or an aiudogoxis technology or by insertion into 
hypermutating cell lines ^ch have spontaneously deleted endogenous V genes. 
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Alternatively, V region gene(s) may be replaced by exploiting the observation that 
hypermutation is accompanied by double stranded breaks in the vicinity of rearranged V 
genes. 

5 The invention is fiarther described below, for the purposes of illustration only, in the 
following examples. 

Example 1: Selection of a hypermutating cell 

10 In order to screen for a cell that undergoes hypermutation in vitro, the extent of diversity 
that accimiulates in several human Burkitt lymphomas during clonal expansion is 
assessed. The Burkitt lines BL2, BL41 and BL70 are kindly provided by G. Lenoir 
(lARC, Lyon, France) and Ramos (Klein et al., 1975) is provided by D. Fearon 
(Cambridge, UK). Their rearranged Vh genes are PGR amplified from genomic DNA 

15 using multiple Vh family primers together with a Jh consensus oligonucleotide. 
Amplification of reananged Vh segments is accomplished using Pfii polymerase together 
with one of 14 primers designed for each of the major human Vh families (Tomlinson, 
1997) and a consensus Jh back primer which anneals to all six hxmian Jh segments 
(JOL48, 5'-GCGGTACCTGAGGAGACGGTGACC-3', gift of C. Jolly). Amplification 

20 of the Ramos Vh from genomic DNA is performed with oUgonucleotides RVHFOR (5'- 
CCCCAAGCTTCCCAGGTGCAGCTACAGCAG) and JOL48. Amplification of the 
expressed Vh-C|x cDNA is perfoimed using RVHFOR and Cn2BACK (5'- 
CCCCGGTACCAGATGAGCTTGGACTTGCGG). The genomic Cjil/2 region is 
amplified using Cp2BACK with CjilFOR (5'- 

25 CCCCAAGCTTCGGGAGTGCATCCGCCCCAACCCTT); the functional C^i aUele of 
Ramos contains a C at nucleotide 8 of C|i2 as opposed to T on the non-functional allele. 
Rearranged V;^s are ampUfied usmg 5'-CCCCAAGCTTCCCAGTCTGCCCTGACTCAG 
and 5'-CCCCTCTAGACCACCTAGGACGGTC-AGCTT. PGR products are purified 
using QIAquick (Qiagen) spin columns and sequenced using an ABD77 sequencer 

30 following cloning into M13. Mutations are computed using the GAP4 alignment program 
(Bonfieldetal., 1995). 
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Sequencing of the cloned PGR products reveals considerable diversity in the Ramos cell 
line (a prevalence of 2.8xl0"3 mutations bp-1 in the Vh) although significant 
heterogeneity is also observed in BL41 as well as in BL2. See Figure lA. Sequence 
5 diversity in the rearranged Vh genes of four sporadic Burkitt lymphoma lines are shown 
as pie charts. The rearranged Vh genes in each cell line are PGR amplified and cloned 
into Ml 3. For each cell line, the consensus is taken as the sequence conunon to the 
^ greatest number of Ml 3 clones and a germline counterpart (indicated above each pie) 
assigned on the basis of closest match using the VBASE database of human 
10 immunoglobulin sequences (Tomlinson, 1997). The Vh consensus sequence for Ramos 
used herein differs in 3 positions fi-om the sequence determined by Chapman et al (1996), 
five positions fix)m that determined by Ratech (1992) and six positions from its closest 
germline counterpart Vh4(DP-63). 

15 The analysis of Vh diversity in Ramos is extended by sequencing the products firom nine 
independent PGR amplifications. This enables a likely dynastic relationship between the 
mutated clones in the population to be deduced, minimising the number of presumed 
independent repeats of individual nucleotide substitutions (Figure IB). 315 M13VH 
clones obtained from nine independent PGR amplifications are sequenced; the dynasty 

20 only includes sequences identified (rather than presumed intermediates), bidividual 
mutations are designated according to the format "C230" with 230 being the nucleotide 
position in the Ramos Vh (numbered as in Figure 3) and the "G" indicating the novel base 
at that position. The criterion used to deduce the genealogy is a minimisation of the 
number of independent occurrences of the same nucleotide substitution. The majority of 

25 branches contain individual members contributed by distmct PGR amplifications. The rare 
deletions and duplications are indicated by the prefix "x" and "d" respectively. Arrows 
highlight two mutations (a substitution at position 264 yielding a stop codon and a 
duplication at position 184) whose position within the tree implies that mutations can 
continue to accumulate following loss of fiinctionai heavy chain expression. 

30 
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PCR artefacts make little contribution to the database of mutations; not only is the 
prevalence of nucleotide substitutions greatiy in excess of that observed in control PCR 

amplifications (<O.OSxlO~^ bp'^) but also identically mutated clones (as well as 
dynastically related ones) are found in independent amplifications. In many cases, 
S generations within a lineage differ by a single nucleotide substitution indicating tiiat only 
a small number of substitutions have been introduced in each roimd of mutation. 

Analysis of V;^ rearrangements reveals that Ramos harbours an in-frame rearrangement of 
Vx2.2-16 (as described by Chapman et al. 1996)) and an out-of-frame rearrangement of 
10 Vx2.2-25. There is mutational diversity in both rearranged Vx,s although greater diversity 
has accumulated on the non-fimctional allele (Figure IC). 

A classic feature of antibody hypermutation is that mutations largely accumulate in the V 
region but scarcely in the C. This is also evident in the mutations that have accumulated 
15 in the Ramos IgH locus (Figure ID). M13 clones containing cDNA inserts extending 
through Vh, C^lI and the first 87 nucleotides Cm2 are generated by PCR from the mitial 
Ramos culture. The Pie charts (presented as in Figure 1 A) depict the extent of mutation 
identified in the 341 nucleotide stretch of Vh as compared to a 380 nucleotide stretch of 
Cn extending from the beginning of Cjil . 

20 

The IgM immunoglobulin produced by Ramos is present both on the surface of the cells 
and, in secreted form, in the culture medium. Analysis of the culture medium reveals that 
Ramos secretes immxmoglobulin molecules to a very high concentration, approximately 
Ijxg/ml. Thus, Ramos is capable of secreting immunoglobulins to a level which renders it 
25 unnecessary to reclone immimoglobulin genes into expression cell lines or bacteria for 
production. 
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Example 2: Vh diversificatioii in Ramos is constitutive 

To address whether V gene diversification is ongoing, the cells are cloned and Vh 
diversity assessed using a MutS-based assay after periods of in vitro culture. The Ramos 
5 Vh is PGR amplified and purified as described above using oligonucleotides containing a 

biotinylated base at the 5*-end. Following denaturation/renaturation (99^C for 3 min; 75 

for 90 min), the extent of mutation is assessed by monitoring the binding of the 
mismatched heteroduplexed material to the bacterial mismatch-repair protein MutS, filter- 
boimd, with detection by ECL as previously described (Jolly et al., 1997). 

10 

The results indicate that Vh diversification is indeed ongoing (see Figure 2A). DNA is 
extracted fi-om Ramos cells that have been cultured for 1 or 3 months following limit 
dilution cloning. The rearranged Vh is PGR amplified using biotinylated 
oligonucleotides prior to undergoing denaturation/renaturation; mismatched 
1 S heterodiq)lexes are then detected by binding to immobilised MutS as previously described 
(Jolly et al., 1997). An aliquot of the renatured DNA is bound directly onto membranes to 
confirm matched DNA loading (Total DNA control). Assays performed on the Ramos 
Vh amplified firom a bacterial plasmid template as well as fiom the initial Ramos culture 
are included for comparison. 

20 

The Vh genes are PGR amplified fi:om Ramos cultures that have been expanded for four 
(Rcl) or six (Rcl3 and 14) weeks (Figure 2B). A mutation rate for each clone is indicated 
and is calculated by dividing the prevalence of independent Vh mutations at 4 or 6 weeks 
post-cloning by the presumed number of cell divisions based on a generation time of 24 h. 
25 The sequences reveal step-wise mutation accumulation with a mutation rate of about 
0.24xlO"4 mutations bp"^ generation-^. 

Direct comparison of the Vh mutation rate in Ramos to that in other cell-lines is not 
straightforward since there is litde information on mutation rates in other lines as judged 
30 by unselected mutations incorporated throughout the Vh obtained following clonal 
expansion fi:om a single precursor cell. However, the prevalence of mutations following a 
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two week expansion of SO precursor BL2 cells has been determined under conditions of 
mutation induction (2.7xl0"3 mutations bp'l; Denepoux et al,, 1997). Similar 
e?q)eriments performed with Ramos under conditions of normal culture reveal a mutation 

prevalence of 2.3xlO"3 mutations bp"l . Various attempts to rahance the mutation rate by 
5 provision of cytokines, helper T cells etc. have proved unsuccessful. Thus, the rate of 
mutation that can be achieved by specific induction in BL2 cells appears to be similar to 
the constitutive rate of Vh mutation in Ramos. 

Example 3: Examination of the nature of Vh mutations in Ramos 

10 

A database of mutational events is created which combines those detected in the initial 
Ramos culture (firom 141 distinct sequences) with those detected in four subclones that 
have been cultured in various experiments without specific selection (firom a fiirther 135 
distinct sequences). This database is created after the individual sets of sequences have 

15 been assembled into dynastic relationships (as detailed in the legend to Figure IB) to 
ensure that clonal expansion of an individual mutated cell does not lead to a specific 
mutational event being counted multiple times. Here an analysis of this composite 
database of 340 distinct and presumably unselected mutational events (200 contributed by 
the initial Ramos culture and 140 from the expanded subclones) is described; separate 

20 analysis of the initial and subclone populations yields identical conclusions. 

The overwhelming majority of the mutations (333 out of 340) are single nucleotide 
substitutions. A small number of deletions (4) and duplications (3) are observed but no 
untemplated insertions; these events are further discussed below. There are only five 
25 sequences which exhibited nucleotide substitutions in adjacent positions; however, in 
three of these five cases, the genealogy revealed that the adjacent substitutions have been 
sequentially incorporated. Thus, the simultaneous creation of nucleotide substitutions in 
adjacent positions is a rare event. 

30 The distribution of the mutations along the Vh is highly non-random (See Figure 3). 
Independently occurring base substitutions are indicated at each nucleotide position. The 
locations of CDRl and 2 are indicated. Nucleotide positions are numbered firom the 3'- 
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end of the sequencing primer with nucleotide position +1 corresponding to the jBrst base 
of codon 7; codons are numbered according to Kabat. Mutations indicated in italics 
(nucleotide position 15, 193, 195 and 237) are substitutions that occur in a mutated 
subclone and have reverted the sequence at that position to the indicated consensus. 

5 

The major hotspot is at the G and C nucleotides of the Ser82a codon, which has 
previously been identified as a major intrinsic mutational hotspot in other Vh genes 
(Wagner et al,, 1995; Jolly et al., 1996) and conforms to the RGYW consensus (Rogozin 
and Kolchanov, 1992; Betz et al., 1993). Whilst the dominant intrinsic mutational 
10 hotspot in many Vji genes is at Ser31, this codon is not present in the Ramos consensus 
Vh (or its germline counterpart) which have Gly at that position. The individual 
nucleotide substitutions show a marked bias in &vour of transitions (51% rather than 
randomly-expected 33%). There is also a striking preference for targeting G and C which 
account for 82% of the nucleotides targeted (Table 1). 

15 
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Table 1. Nucleotide substitution preferences of hypermutation in Ramos 

Frequency of substitution to 

5 T C G A Total 
Parental 
nucleotide 

T - 3.9 1.2 3.0 8.1 

10 C 17.4 - 12.6 4.8 34.8 

G 7.2 15.9 - 24.0 47.1 

A 2.4 1.8 5.7 - 9.9 

15 

Single nucleotide substitutions were computed on the Vh coding strand and are given as 

the percentage of the total number (333) of independent, unselected nucleotide 
substitutions identified. 



20 

Example 4: Selection of hypermutating cells by IgM-loss 

Analysis of the Ramos variants reveals several mutations that must have inactivated Vh 
(see Figxire IB) suggesting it might be possible for the cells to lose IgM expression but 
25 remain viable. If this is the case, Ig expression loss would be an easy means to select a 
constitutively hypermutating B cell line. 

Analysis of the Ramos culture reveals it to contain 8% surfece IgM" cells. Such IgM-loss 
variants are generated during in vitro culture, as follows. The starting Ramos culture is 
30 transfected with a pSV2neo plasmid, diluted into 96-well plates and clones growing in 
selective medium allowed to expand. Flow cytometry performed on the expanded clones 
six months after the original transfection reveals tiie presence of IgM-loss variants. 
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constituting 16% and 18% of the two clonal populations (Rcl3 and Rcl4) shown here 
(Figure 4A). Enrichm^t by a single round of sorting yields subpopulations that contain 
87% (Rcl3) and 76% (Rcl4) surface IgM-negative cells. Following PGR amplification 
of the rearranged Vh gene in these subpopulations, sequencing reveals that 75% (Rcl3) 
5 and 67% (Rcl4) of the cloned Vh segments contained a nonsense (stop), deletion (del) or 
duplication (dup) mutation within the 341 nucleotide Vh stretch analysed. The remainder 
of the clones are designated wild type (wt) although no attempt is made to discriminate 
possible VH-inactivating missense mutations. The 4 deletions and 3 duplications 
identified in the Rcl3 popidation are all distinct whereas only 4 distinct mutations 
10 account for the 7 Rcl4 sequences detemiined that harbour deletions. The nature of the 
deletions and duplications is presented in Figure 6: each event is named with a letter 
followed by a number. The letter gives the provenance of the mutation (A, B and C being 

the cloned TdT" control transfectants, D, E and F the TdT^ transfectants and U signifies 
events identified in the initial, unselected Ramos culture); the number uidicates the first 

15 nucleotide position in the sequence string. Nucleotides deleted are specified above the 
line and nucleotides added (duplications or non-templated insertions) below the line; 
single nucleotide substitutions are encircled with the novel base being specified. The 
duplicated segments of Vh origin are underlined; non-templated insertions are in bold. 
With several deletions or duplications, the event is flanked by a single nucleotide of 

20 unknown provenance. Such flanking changes could well arise by nucleotide substitution 
(rather than non-templated insertion) and these events therefore separately grouped; the 
assignment of the single base substitution (encircled) to one or other end of the 
deletion/duplication is often arbitrary. 

25 The IgM" cells are enriched in a single round of sorting prior to PGR amplification and 
cloning of their Vh segments. The sequences reveal a considerable range of VH- 
inactivating mutations (stop codons or fi-ameshifts) (Figure 4) although diverse 
inactivating mutations are even evident in IgM-loss variants sorted after only 6 weeks of 
clonal expansion (see Figure 5). In Figure 5A expression of TdT in three pSV-ppG/TdT 

30 and three control transfectants of Ramos is compared by Western blot analysis of nuclear 



wo 02/100998 PCT/GB02/02688 

31 

protein extracts. Nalm6 (a TdT-positive hiunan pie-B cell lymphoma) and HMy2 (a TdT- 
negative matuie human B lymphoma) provided controls. 

In Figure 5B, pie charts are shown depicting independent mutational events giving rise to 
IgM-loss variants. IgM" variants (constituting 1-5% of the population) are obtained by 
5 sorting the three TdT^ and three TdT' control transfectants that have been cultured for 6 
weeks following cloning. The Vh regions in the sorted subpopulations are PGR 
amplified and sequenced. The pie charts depict the types of mutation giving rise to Vh 

inactivation with the data obtained from the TdT^ and TdT" IgM' subpopulations 
separately pooled. Abbreviations are as m Figure 4A except that "ins" indicates clones 
10 containing apparently non-templated nucleotide insertions. Clones containing deletions 
or dvq)hcations together with multiple nucleotide non-templated insertions are only 
included within the "ins" segment of tiie pie. Only unambiguously distinct mutational 
events are computed. Thus, of the 77 distinct VH-inactivating mutations identified in the 

TdT^ IgM-loss subpopulations, 30 distinct stop codon mutations are identified; if the 
1 5 same stop codon have been independently created within the IgM-loss population derived 
firom a single Ramos transfectants this would have been underscored. 

The stop codons are created at variety of positions (Figure 4B) but are not randomly 
located. Figure 4B summarises the nature of the stop codons observed in the Rcl3 and 

20 Rcl4 IgM-loss populations. At least eight independent mutational events yield the 
nonsense mutations which account for 20 out of the 27 non-functional Vh sequences in 
the Rcl3 database; a minimum often independent mutational events yield the nonsense 
mutations which account for 15 of the 22 non-functional Vh sequences in the Rcl4 
database. The numbers in parentheses after each stop codon give the number of 

25 sequences in that database that cany the relevant stop codon followed by the number of 
these sequences that are distinct, as discriminated on the basis of additional mutations. 
Analysis of stop codons in IgM-loss variants selected from four other clonal populations 
reveals stop codon creation at a further five locations within Vh- In data obtained in six 
independent experiments, stop codon creation is restricted to 16 of the 39 possible sites; 

30 the DNA sequences at these preferred sites being biased (on either coding or non-coding 
strand) towards the RGYW consensus. 
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Not surprisingly, whereas deletions and insertions account for only a small proportion of 
the mutations in imselected Ramos cultures (see above), they make a much greater 
contribution vrfien attention is focused on VH-inactivating mutations. It is notable that a 

5 large proportion of the IgM-ioss variants can be accounted for by stop-codon/frameshifl 
mutations in the Vh itself. This further supports the proposal that hypermutation in 
Ramos is preferentially targeted to the immunoglobulin V domain - certainly rather than 
the C domain or, indeed other genes (such as the Igot/IgP sheath) whose mutation could 
lead to a surface IgM" phenotype. It also may well be that the Ramos Vh is more 

10 frequently targeted for hypermutation than its productively rearranged V;^, a conclusion 
supported by the pattem of mutations in the initial cxilture (Figure IC). 

Selection of cells by detection of Ig loss variants is particularly usefid v/here those 
variants are capable of reverting, i.e. of reaquiring their endogenous Ig-expressing ability. 

15 The dynasty established earlier (Figure IB) suggests not only that IgM-loss cells could 
arise but also that they might undergo further mutation. To confirm this, IgM-loss 
variants sorted from Rcl3 are cloned by limiting dilution. Three weeks after cloning, the 
presence of IgM"^ revertants in the IgM" subclones is screened by cytoplasmic 
immunofluorescence analysis of SxlO^ cells; their prevalence is given (Figure 4C). These 

20 IgM^ revertants are then enriched in a single round of sorting and the Vh sequences of 
the clonal IgM" variant compared to that it of its IgM^ revertant descendants. 

Cytoplasmic immunofluorescence of ten expanded clonal popvdations reveals the 
presence of IgM^ revertants at varying prevalence (from 0.005% to 1.2%; Figure 4C) 

25 allowing a mutation rate of 1x10-4 mutations bp"! generation-1 to be calculated by 
fluctuation analysis. This is somewhat greater than the rate calculated by direct analysis 
of unselected mutations (0.25x10-4 mutations bp-1 generation-1; see above), probably in 
part reflecting that different IgM-loss clones revert at different rates depending upon the 
nature of the disrupting mutation. Indeed, the sequence surrounding the stop codons in 

30 the IgM-loss derivatives of Rcl3 reveals that TAG32 conforms well to the RGYW 
consensus (R = purine, Y = pyrimidine and W = A or T; Rogo2dn and Kolchanov, 1992) 
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which accounts for a large proportion of intrinsic mutational hotspots (Betz et al., 1993) 
whereas TAA33 and TGA36 do not (Figure 4D). 

Example 5: Selection of a novel Ig binding activity 

5 

In experiments designed to demonstrate development of novel bindii^ afBnities, it is 
noted that most members of the Ramos cell line described below ejqpress a membrane 
IgM molecule which binds anti-idiotype antibodies (anti-Idl and anti-Id2), specifically 
raised against the Ramos surface IgM. However, a few cells retain a surface IgM, yet fail 
10 to bind the anti-idiotype antibody. This is due to an alteration in binding affinity in the 
surface IgM molecule, such that it no longer binds antibody. Cells which express a 
surface IgM yet cannot bind antibody can be selected in a single round of cell sorting 
according to the invention. 

15 This is demonstrated by isolating \x positive/id-negative clones which have lost the 
capacity to buid to anti-Id2 despite the retention of a surface IgM, by ELIS A. The clones 
are sequenced and in six independent clones a conserved Vh residue, K70, is foimd to be 
mutated to N, M or R as follows: 

Clone Mutation 



2 


K70N 


AAG-AAC 




S77N 


AGC-AAC 


4 


K70M 


AAG-ATG 


9 


S59R 


AGT-AGG 




K70N 


AAG-AAC 


10 


K70N 


AAG-AAC 


12 


K70N 


AAG-AAC 


13 


K70R 


AAG-AGG 



No mutations were observed in the light chain. Thus, it is apparent that mutants may be 
selected from the Ramos cell line in which the Ig molecule produced has a single base- 
pair variation with respect to the parent clone. 
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Making use of an anti-Idl, a similar population of cells is isolated which retain expression 
of the Ig)Li constant region but which have lost binding to the anti-idiotype antibody. 
These cells are enriched by sorting cytometry and the sequence of Vh determined (Figure 
S 7). This reveals six mutations when compared with the consensus sequence of the 
starting population. Two of these mutations result in amino acid sequence changes 
around CDR3 (R->T at 95 and P->H at 98). Thus, selection of more subtle changes in the 
immxmoglobulin molecule are selectable by assaying for loss of binding. 

10 In further experiments, hyperaiutating cells according to the invention are washed, 
resuspended in PBS/BSA (10^ cells in 0.25inl) and mixed with an equal volume of 
PBS/BSA containing 10% (v/v) antigen-coated magnetic beads. In the present 
experiment, streptavidin coated magnetic beads (Dynal) are used. After mixing at 4° C on 
a roller for 30 mins, the beads are washed three times with PBS/BSA, each time bringing 

IS down the beads with a magnet and removing unbound cells, remaining cells are &en 
seeded onto 96 well plates and expanded up to 10^ cells before undergoing a further round 
of selection. Multiple rounds of cell expansion (accompanied by constitutively-ongoing 
hypermutation) and selection are performed. After multiple rounds of selection, the 
proportion of cells which bind to the beads, which is initially at or close to background 

20 levels of 0.02%, begins to rise. 

After 4 rounds, enrichment of streptavidin binding cells is seen. This is repeated on the 
fifth round (Figure 8). The low percentage recovery reflects saturation of the beads with 
cells since changing the cellrbead ratio from vast excess to 1:2 allows a recovery of 
25 approximately 20% from round five streptavidin binding cells (Figure 9). This 
demonstrates successful selection of a novel binding specificity from the hypermutating 
Ramos cell line, by four rounds of iterative selection. 

Nucleotide sequencing of the heavy and light chains from the streptavidin binding cells 
30 predicts one amino acid change in Vh CDR3 and four changes in Vl (1 in FRl, 2 in 
CDRl and 1 in CDR2) when compared with the consensus sequence of the starting 
population (Figurel 1). 
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To ensure that the binding of streptavidin is dependent on egression of sur&ce 
inununoglobulin, inmiiinoglobiilin negative variants of the streptavidin binding cells are 
enriched by sorting cytometry. This markedly reduces the recovery of streptavidin 
5 binding cells with an excess of beads. The cells recovored by the Dynal-streptavidin 
beads firom the sorted negative cells are in fact Ig^ positive and most likely represent 
efEicient recovery of Ig^ streptavidin binding cells contaminating the immunoglobulin 
negative sorted cell population. 

10 Preliminary data suggest that the efficiency of recovery is reduced as the concentration of 
streptavidin on the beads is reduced (Figure 9). This is confirmed by assaying the 
recovery of streptavidin binding cells with beads incubated with a range of concentrations 
of streptavidin (Figure 10). The percentage of cells recoverable from a binding 
population is dictated by the ratio of beads to cells. In this experiment the ratio is < 1 : 1 

IS beadsicells. 

In a further series of experiments, a further two rounds of selection are completed, takmg 
the total to 7. This is accomplished by reducmg the concentration of streptavidin bound 
to the beads from 50jig/ml in round 5 to 10}xg/ml in round 7. Although the secretion 
20 levels of IgM is comparable for the populations selected in rounds 4 to 7 (Figure 12), 
streptavidin binding as assessed by ELIS A is clearly greatly increased in rounds 6 and 7, 
in comparison with round 4 (Figure 13). 

This is confirmed by assessment of binding by Surface Plasmon Resonance on a BiaCore 
25 chip coated with streptavidin (Figure 14). The supernatant from round 7 is injected to 
flow across the chip at point A, and stopped at point B. At point C, anti-human IgM is 
injected, to demonstrate that the material bound to the streptavidin is IgM. The gradient 
A-B represents the association constant, and the gradient B-C to dissociation constant. 
From the BiaCore trace it is evident that round 6 siq>ematant displays superior binding 
30 characteristics to that isolated from round 4 populations or unselected Ramos cells. 



wo 02/100998 PCT/GB02/02688 

36 

Antibodies from round 6 of the selection process also show improved binding with 
respect to round 4. Binding of cells from round 6 selections to streptavidin-FITC 
aggregates, formed by preincubation of the fluorophore with a biotinylated protein, can be 
visualised by FACS, as shown in Figure IS. Binding to round 4 populations, unselected 
5 Ramos cells or IgM negative Ramos is not seen, indicating maturation of streptavidin 
binding. 

Use of unaggregated streptavidin-FITC does not produce similar results, with the majority 
of round 6 cells not binding. This, in agreement with ELISA data, suggests that binding 
10 to streptavidin is due to avidity of the antibody binding to an array of antigen, rather than 
to a monovalent affinity. Higher affinity binders may be isolated by sorting for binding to 
non-aggregated streptavidin-FITC. 

In order to determine the mutations responsible for the increased binding seen in round 6 
15 cells over round 4 cells, the light and heavy chain antibody genes are amplified by PGR, 
and then sequenced. In comparison with round 4 cells, no changes in the heavy chain 
genes are seen, with the mutation R103S being conserved. In the light chain, mutations 
V23F and G24C are also conserved, but an additional mutation is present at position 46. 
Wild-type Ramos has an Aspartate at this position, \\dulst round 6 cells have an Alanine. 
20 Changes at this position are predicted to affect antigen binding, since residues in this 
region contribute to CDR2 of the light chain (Figure 16). It seems likely that mutation 
D46A is responsible for the observed increase in binding to streptavidin seen in round 6 
cells. 

25 Example 6; //; Vitro Maturation of Ramos streptavidin binders 
Ram B -> Ram C (selecting with FITC-Poly-Streptavidin) 

Approximately 5 x 10^ Ram B cells (derived from the Ramos cell line to bind 
30 Streptavidin coated microbeads) are washed with PBS and incubated on ice in 1 ml of 
PBS/BSA solution containing Poly-Streptavidin-FITC for 30 mmutes (Poly-Streptavidin- 
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FTTC is made by adding sti:q)tavidin FTTC (20ng/ml protein content) to a biotinylated 
protein (lOfig/ml) and incubating on ice for a few minutes prior to the addition of cells) 

The cells are then washed in ice cold PBS briefly, spun down and resuspended in 500 ^1 
5 PBS, 

The most fluorescent 1% of cells are sorted on a MoFlo cell sorter, and this population of 
cells is returned to tissue culture medium, expanded to approximately 5x 10^ cells and the 
procedure repeated. 

10 

After four rounds of sorting with poly-Streptavidin-FITC the cells are bindiag weakly to 
Streptavidin-FITC. Sequence of the expressed immunoglobulin V regions from this 
Ramos cell population reveals that amino acid number 82a in framework three of the 
heavy cham V region had changed from Serine to Arginine. This population of cells is 
15 called Ram C. 

Ram C-> Ram D (Selecting with FITC-Streptavidin) 

The next few rounds of cell sorting are done as described above but now using 
20 streptavidin-FITC (20(ig/ml protein content). 

After three rounds of sorting using Streptavidin-FITC the sorted cell population (called 
Ram D) is binding more strongly to Streptavidin FITC as assayed by FACS. Sequence of 
the expressed V genes reveals a fiirther amino acid change. In firework three the amino 
25 acid at position 65, originally a Serine, has changed to Arginine 

Ram D -> Ram £ (Selecting with FITC-Streptavidin and unlabelled Streptavidin 
eompetition) 

30 A subsequent sorting is done as described above using Streptavidin-FITC. However, after 
staining the cells on ice for 30 minutes, the cells are washed in ice cold PBS once and 
then resuspended in 0.5mg/ml Streptavidin and incubated on ice for 20 minutes. This is 
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in order to compete against the already bound Streptavidin-FTTC, such that only 
Streptavidin-FTTC that is strongly bound remains. The cells are then washed once in ice 
cold PBS and resuspended in SOOjil PBS prior to sortmg the most fluorescent 1% 
population as before. 

5 

After repeating this sortmg protocol a further two times the Ramos cell population (Ram 
E) appears to bind quite strongly to Streptavidin-FITC. These cells have acquired another 
amino acid change in framework one of the expressed heavy chain V gene; the amino acid 
at position 10 had changed from Glycine to Arginine. Moreover, residue 18 has changed 
1 0 from Leucine to Methionine. 

The results of the streptavidin maturation in Ramos cells are shown in Figure 17. 

ELISA comparison 

15 

An ELISA assay performed with the supematants of the various Ramos cell populations 
confirms that the IgM antibody expressed and secreted from Ramos cells has been 
matured in vitro to acquire a strong affinity for streptavidin. The results are set forfli in 
Figure 18. 

20 

Example 7: Construction of transgene comprising hypermutation-directing 
sequences 

25 It is known that certain elements of Ig gene loci are necessary for direction of 
hypermutation events in vivo. For example, the intron enhancer and matrix attachment 
region Ei/MAR has been demonstrated to play a critical role (Betz et al, 1994). 
Moreover, the 3' enhancer E3' is known to be important (Goyenechea et al, 1997). 
However, we have shown that these elements, whilst necessary, are not sufficient to direct 

30 hypermutation in a transgene. 
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In contrast, provision of Ei/MAR and E3' together with additional Jk-Ck intron DNA and 
Ck is sufiScient to confer hypermutability. A PG-Ck transgene is assembled by joining an 
0.96 Kb PCR-generated KpnI-Spel p-globin fragment (that extends from -104 with 
respect to the p-globin transcription start site to +863 and has artificial Kpnl and Spel 
5 restriction sites at its ends) to a subfragment of LkA[3T1] [Betz et al, 1994] that extends 
from nucleotide 2314 in the sequence of Max et al [1981] through Ei/MAR, Ck and E3', 
and includes the 3'Fl deletion. 

Hypennutation is assessed by sequencing segments of the transgene that are PGR 
10 amplified using Pfii polymerase. The amplified region extends from immediately 
upstream of the transcription start site to 300 nucleotides downstream of Jk5. 

This chimeric transgene is well targeted for mutation with nucleotide substitutions 
accumulating at a frequency similar to that found in a normal IgK transgene. This 
15 transgene is the smallest so far described that efficiently recruits hypermutation and the 
results indicate that multiple sequences located somewhere in the region including and 
flanking Ck combine to recruit hypermutation to the 5 -end of the p-globin/IgK chimaera. 

The recruitment of hypermutation can therefore be solely directed by sequences lying 
20 towards the 3 -end of the hypermutation domain. However, the 5'-border of the mutation 
domain in normal Ig genes in the vicinity of the promoter, some 100-200 nucleotides 
downstream of the transcription start site. This positioning of the 5 -border of the 
mutation domain with respect to the start site remains even in the pG-CK transgene when 
the p-globin gene provides both the promoter and the bulk of the mutation domain. These 
25 results are consistent with findings made with other transgenes indicating that it is the 
position of the promoter itself that defines the 5'-border of the mutation domain. 

The simplest explanation for the way in which some if not all the k regulatory elements 
contribute towards mutation recruitment is to propose that they work by bringing a 
30 hypermutation priming factor onto the transcription initiation complex. By analogy with 
the classic studies on enhancers as transcription regulatory elements, the Igic enhancers 
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may work as legulators of hyp^cmutation in a position and orientation-independent 
manner. Indeed, the data obtained with the PG-Ck transgene together with previous 
results in which E3* was moved closer to Ck [Betz et aL, 1994] reveal that the 
hypennutation-enhancing activity of E3' is neither especially sensitive to its position or 
S orientation wifh resi^ct to the mutation domain. 

Ei/MAR normally lies towards the 3 -end of the mutation domain. Whilst deletion of 
Ei/MAR drastically reduces the efficacy of mutational targeting, its restoration to a 
position upstream of the promoter (and therefore outside the transcribed region) gives a 

10 partial rescue of mutation but without apparently affecting the position of the 5'-border of 
the mutational domain. Independent confumation of these results was obtained in 
transgenic mice using a second transgene, tk-neoiiCK, in which a neo transcription unit 
(under control of the HSYtk promoter) is integrated into the Ck exon by gene targetitig in 
embryonic stem cells [Zou, et al., 1995]. In this mouse, following Vk-Jk jo™iig> the IgK 

15 Ei/MAR is flanked on either side by transcription domains: the V gene upstream and 
tk::neo downstream. The tk-neo gene is PGR amplified from sorted germinal centre B 
cells of mice homozygous for the neo insertion. 

For the tk-neo insert in tk-neowCyi mice, the amplified region extends fi"om residues 607 
20 to 1417 [as numbered in plasmid pMCNeo (GenBank accession U43611)], and the 
nucleotide sequence determined from position 629 to 1329. The mutation frequency of 
endogenous VJk rearrangements in th-neo ::Ck mice is determined using a strategy 
similar to that described in Meyer et al, 1996. Endogenous VJk5 rearrangements are 
amplified using a Vk FR3 consensus forward primer 
25 (GGACTGCAGTCAGGTTCAGTGGCAGTGGG) and an oUgonucleotide LkFOR 
[Gonzalez-Fernandez and Milstein, (1993) PNAS (USA) 90:9862-9866] that primes back 
from downstream of the Jk cluster. 

Although the level of mutation of the tk-neo is low and it is certainly less efiScientiy 
30 targeted for mutation than the 3*-flanking region of rearranged Vk genes in the same cell 
population, it appears that - as with normal V genes - the mutation domain in the neo gene 
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insert starts somewhat over 100 nucleotides dovmstream of the transcription start site 
despite the fact that £i/MAR is upstream of flie promoter. 

Thus, transgenes capable of directing hypennutation in a constitutively hypermutating ceU 
S line may be constructed using Ei/MAR, E3' and regulatory elements as defined herein 
foxmd downstream of Moreover, transgenes may be constructed by replacement of or 
insertion into endogenous V genes, as in the case of the tk-neo ::Ck mice, or by linkage of 
a desired coding sequence to the Jk intron, as in the case of the PG-Ck transgene. 

10 Example 8: selection of constitutively hypermutating cell line 

As described above, a small proportion of V gene conversion events can lead to the 
generation of a non-functional Ig gene, most firequently through the introduction of 
fi:mneshift mutations. Thus, the generation of sIgM loss-variants in the chicken binrsal 

IS lymphoma cell line, DT40, can be used to give an initial indication of IgV gene 
conversion activity. Compared to the parental DT40 line, a mutant that lacks RadS4 
shows a considerably diminished proportion of sIgM-loss variants (Fig, 19). A fluctuation 
analysis performed on multiple clones reveals that the ARADS4 line generates sIgM-loss 
variants at a firequency nearly tenfold less than that of parental DT40 whilst a ARAD52 

20 line generates sIgM-loss variants at a similar frequency to wildtype cells (Fig. 19). These 
observations are in keeping with earlier findings concerning gene conversion in ARAD54 
and ARAD52-DT40 cells (Bezzubova, etal, 1997; Yamaguchi-Iwai et al, 1998). 

This analysis is extended to DT40 cells lacking Xrcc2 and Xrcc3. These RadSl 
25 paralogues have been proposed to play a role in the recombmation-dependent pathway of 
DNA damage repair (Liu et al, 1998; Johnson et al, 1999; Brenneman et al, 2000; 
Takata et al, 2001). Rather than giving rise to a diminished abundance of sIgM-loss 
variants, the AXRCC2 and AXRCC3 lines show a much greater accumulation of loss 
variants than the parental line (Fig. 19). In the case of AXRCC2-DT40, transfection of the 
30 human Xrcc2 cDNA under control of the human p-globm promoter causes the firequency 
of generation of sIgM-loss variants to revert to close to wildtype values. Figure 19 shows 
the generation of sIgM-loss variants by wildtype and repair-deficient DT40 cells. Flow 
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cytometric analyses of the heterogeneity of sIgM expression in cultures derived by 1 
month of clonal expansion of single sIgM* normal (WT) or repair-deficient (ARAD54, 
ARAD52, AXRCC2, AXRCC3) DT40 cells are shown in panel (a). An analysis of 
cultures derived from three representative slgM^ precursor clones is showa for each type 
S of repair-deficient DT40. The percentage of slgNC cells in each analysis is indicated with 
the fluorescence gate set as eightfold below the centre of the slgM^ peak. Panel (b) 
shows fluctuation analysis of the fi:equency of generation of sIgM-loss variants. The 
abundance of sIgM-loss variants is determined in multiple parallel cultures derived firom 
slgivr^ single cells after 1 month of clonal expansion; median percentages are noted above 

10 each data set and indicated by the dashed bar. The [pPG-hXRCC2]AXRCC2 transfectants 
analysed are generated by transfection of ppG-hXRCC2 into slgM^ DT40-AXRCC2 
subclones that have 6.4% and 10.2% slgM" cells in the fluctuation analysis. The whole 
analysis is performed on multiple, independent slgM^ clones (with distinct, thoi^ similar 
ancestral YX sequences) giving, for each repair-deficient line, average median fi:equencies 

15 at which sIgM-loss variants are generated after 1 month of WT (0.4%), ARAD54 (0.07%), 
ARAD52 (0.4%), AXRCC2 (6%) and AXCRCC3 (2%). 

Since deficiency in both Xrcc2 and Xrcc3 is associated with chromosomal instability (Liu 
et al, 1998); Cui et al, 1999; Deans et al, 2000; GrifBn et at, 2000), it is possible that 
20 the increased frequency of sIgM-loss variants could reflect gross rearrangements or 
deletions within Ig loci. However, Southern blot analysis of 24 sIgM' subclones of 
AXRCC3-DT40 does not reveal any loss or alteration of the 6kb Sall-BamHI fiagment 
containing the rearranged V^. 

25 Therefore, to ascertain whether more localised mutations in the V gene could account for 
the loss of sIgM expression, the rearranged VA. segments in populations of sIgM" cells that 
are sorted firom wildtype, AXRCC2- and AXRCC3-DT40 subclones after one month of 
expansion are cloned and sequenced. 

30 Cell cultare, transfection and analysis 

DT40 subclone CL18 and mutants thereof are propagated in RPMI 1640 supplemented 
with 7% foetal calf serum, 3% chicken serum (Life Technologies), 50tiM 2- 
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mercaptoethanol, penicillin and streptomycin at 37"C in 10% CO2. Cell density was 
maintained at between 0.2 -1.0x10^ ml'^ by splitting the cultures daily. The generation of 
the DT40 derivatives carrying targeted gene disruptions has been described elsewhere 
(Bezzubova et al, 1997; Yamaguchi-Iwai et al, 1998; Takata et al, 1998, 2000, 2001). 
5 Transfectants of AXRCC2-DT40 harbouring a pSV2-neo based plasmid that contains the 
XRCC2 open reading frame (cloned from HeLa cDNA) under control of the P-globin 
promoter are generated by electroporation. 

CL18 is an slglVT subclone of DT40 and is the parental clone for the DNA repair-mutants 
10 described here. Multiple slgM^ subclones are obtained from both wild ^e and repair- 
deficient mutants using a Mo-Flo (Cytomation) sorter after staining with FTTC-conjugated 
goat anti-chicken IgM (Bethyl Laboratories). There is little variation in the initial 
sequence expressed by all the slgM** DT40-CL18 derived repaur-deficient cells used in 
this work since nearly all the slgM** derivatives have reverted the original CL18 
15 frameshift by gene conversion using the \|/V8 donor (which is most closely related to the 
frameshifted CL18 CDRl). 

Mutation analysis 

Genomic DNA is PGR amplified from 5000 cell equivalents using Pfix Turbo (Stratagene) 
20 polymerase and a hotstart touchdown PGR [8 cycles @ 95'G \\ 68-60'G (at TG per 
cycle) r, 72 'G I'SO"; 22 cycles @ 94X 30", 60 'G r, 72'G I'SO"]. The rearranged NX 
is amplified using CVLF6 (5'-CAGGAGGTGGCGGGGGCGTGAGTGATTGCCG; 
priming in the leader- intron) and CVLR3 (5'- 
GGGCAAGCTTGGGGAGGGTGGCGGCAAGTGGAAG; primmg back from 3' of J^); 
25 the unrearranged VXl using CVLF6 with CVLURRl (5'- 
GGAATTGTCAGTGGGAGGAGGAGCAG); the rearranged VH gene using CVHIFI 
(5*.GGGGAGCTCCGTCAGCGCTCTGTGTCC) with CJHIRI (5'- 
GGGGTACCCGGAGGAGACGATGACTTCGG) and the CX region using CJCIRIF (5'- 
GCAGTTGAAGAATTCCTCGCTGG; priming from withm the JX-CX intron) with 
30 GCMUCLAR (5'-GGAGCCATCGATCACGCAATCGAG; priming back from within 
CXy After purification on QIAquick spin columns (Qiagen), PGR products are cut with 
the appropriate restriction enzymes, cloned into pBluescriptSK and sequenced using the 
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T3 or T7 primers and an ABI377 sequencer (Applied Biosystems). Sequence alignment 
(Bonfield et al^ 1995) with GAP4 allowed identification of changes from the consensus 
sequence of each clone. 

5 All sequence changes are assigned to one of three categories: gene conversion, point 
mutation or an ambiguous category. This discrimination rests on the published sequences 
of the pseudogenes that could act as donors for gene conversion. The database of 
such donor sequences is taken from Reynaud et al. (1987) but implementing the 
modifications (McCormack et al, 1993) pertaining to the IgX G4 allele appropriate (Kim 

10 et al 1990) to the expressed IgA, in DT40. (The sequences/gene conversions identified in 
this work supported the validity of this \|/VX, sequence database). For each mutation the 
database of VX pseudogenes is searched for potential donors. If no pseudogene donor 
containing a string >9bp could be found then it is categorised as an untemplated point 
mutation. If a such a string is identified and there are further mutations which could be 

IS explained by the same donor, then all these mutations are assigned to a single gene 
conversion event. If there are no frirther mutations then the isolated mutation could have 
arisen through a conversion mechanism or could have been imtemplated and is therefore 
categorised as ambiguous. 

20 With regard to the Y% sequences cloned from the sIgM" subpopulations sorted from 
multiple wildtype DT40 clones, 67% carry mutations: in the majority (73%) of cases, 
these mutations render the NX obviously non-functional, as shown in Figure 20. 
Presumably, most of the remaining sIgM' cells carry inactivating mutations either in Vh or 
outside the sequenced region of V\. Figure 20 shows analyses of sequences cloned 

2S from sIgM-loss variants. In panel (a), comparison of VX sequences obtained from sIgM- 
loss cells that have been sorted from parental slgM^ clones of normal or Xrcc2-deficient 
DT40 cells after 1 month of clonal expansion. Each horizontal line represents the 
rearranged VA.1/JA, (427 bp) with mutations classified as described above as pomt 
mutations (lollipop), gene conversion tracts O^orizontal bar above line) or single 

30 nucleotide substitutions which could be a result of point mutation or gene conversion 
(ambiguous, vertical bar). Hollow boxes straddling the line depict deletions, triangles 
indicate a dupUcations. Pie charts are shown in panel (b), depicting the proportion of VX 
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sequences that cany different numbers of point mutations (PM), gene conversions (GC) 
or mutations of ambiguous origin (Amb) amongst sorted sIgM-loss popdations derived 
from wildtype, AXRCC2 or AXRCC3 DT40 slgM^ clones aft^ 1 month of clonal 
expansion. The sizes of the segments are proportional to the number of sequences 
5 carrying the number of mutations indicated around the periphery of flie pie. The total 
number of YX sequences analysed is indicated in the centre of each pie with the data 
compiled from analysis of four subclones of wildtype DT40, two of AXRCC2-DT40 and 
three of AXRCC3-DT40. Deletions, duplications and insertions are excluded from this 
analysis; in wildtype cells, there are additionally 6 deletions, 1 duplication and 1 insertion, 
10 There are no other events in AXRCC2-DT40 and a single example each of a 1 bp deletion 
and a Ibp insertion in the AXRCC3-DT40 database. 

Causes of VA, gene inactivation in wildtype, AXRCC2 (AX2) and AXRCC3 (AX3) DT40 
cells expressed as a percentage of the total sequences that contained an identified 

15 inactivating mutation axe set forth in panel (c): Missense mutation (black); Gene 
conversion-associated frameshift (white); Deletions, insertions or duplication-associated 
frameshift (grey). Additional mutational events associated with each inactivating mutation 
are then shown in (d). The data are expressed as the mean number of additional mutations 
associated with each inactivating mutation with the type of additional mutation indicated 

20 as in panel (c). Thus, AXRCC2-DT40 has a mean of 1.2 additional point mutations in 
addition to the index inactivating mutation whereas wildtype DT40 has only 0.07. 

As detailed above, the mutations may be classified as being attributable to gene 
conversion templated by an upstream VA, pseudogene, to non-templated point mutations 

25 or as falling into an ambiguous category. Most (67%) of the inactivating mutations are 
due to gene conversion although some (15%) are stop codons generated by non-templated 
point mutations demonstrating that the low frequency of point mutations seen here and 
elsewhere (Buerstedde et al, (1985); Khn et al, 1990) m DT40 cells is not a PGR artefact 
but rather reveals that a low frequency of point mutation does indeed accompany gene 

30 conversion in wildtype DT40. 
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A strikingly different pattern of mutation is seen in the VX sequences of the sIgM-loss 
variants from AXRCC2-DT40. Nearly all the sequences carry point mutations, typically 
with multiple point mutations per sequence. A substantial shift towards point mutations 
is also seen in the sequences from the slgNT AXRCC3-DT40 cells. Thus, whereas a VA.- 
S inactivating mutation in wild type DT40 is most likely to reflect an out of frame gene 
conversion tract, in AXRCC2/3 it is likely to be a missense mutation (Fig. 20c). 
Furthermore, whereas most of the nonftmctional YX sequences obtained from sorted 
sIgM-loss variants of AXRCC2-DT40 (53%) or AXRCC3-DT40 (64%) carry additional 
point mutations in addition to the VA.-inactivating mutation, such hitchhiking is only 
10 rarely observed in the nonfunctional VX sequences from the parental DT40 line (7%; Fig. 
20d). 

All these observations suggest that the high prevalence of sIgM-loss variants in 
AXRCC2/3-DT40 cells simply reflects a very high frequency of spontaneous IgV gene 

IS hypermutation in these cells. Figure 21 represents analyses of Ig sequences cloned fix)m 
unsorted DT40 populations after one month of clonal expansion. The sequences 
obtained from representative, wildtype and AXRCC2 DT40 clones are presented in panel 
(a) with symbols as in Fig. 20. In panel (b), pie charts are shown depicting the proportion 
of the YX sequences carrying different numbers of the various types of mutation as 

20 indicated. The data are pooled from analysis of independent clones: wildtype (two 
clones), AXRCC2 (four clones) and AXRCC3 (two clones). In addition to the mutations 
shown, one AXRCC2-DT40 sequence contained a 2bp insertion in the leader intron which 
was not obviously templated from a donor pseudogene and one AXRCC3-DT40 sequence 
carried a single base pair deletion also in the leader intron. 

25 

Mutation at other loci of AXRCC2-DT40 is shown in panel (c). Pie charts depict the 
proportion of sequences derived from 1 month-e3q)anded AXRCC2-DT40 cells that carry 
mutations in the rearranged Vh (272bp extending from CDRl to the end of Jh) of the 
rearranged heavy chain of, in the unrearranged V^l on the excluded allele (458 bp) and in 
30 tiie vicinity of CX (425bp extending from the JX-CX intron into tiie first 132bp of CA,). 
Analysis of known Vh pseudogene sequences (Reynaud et al, 1989) does not indicate 
that any of the mutations observed in the reananged Vh are due to gene conversion. 
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strongly suggesting that they are due to point mutation although this assignment cannot be 
regarded as wholly definitive. The mutation prevalences in these data sets are: 1.6x10'^ 
mutations.bp"* for Vh, 0.03 xlO"^ for the unrearranged VXl and 0.13 xlO'^ for CX as 
compared to 2.0 xlO'^ for point mutations in the rearranged YXl in AXRCC2-DT40, 0.13 
5 xlO""* for point mutations in rearranged VXl in wildtype DT40 and 0.04 xlO"^ for 
background PGR error. 

The distribution of point mutations across WXl is shown in panel (d). The AXRCC2- 
DT40 consensus is indicated in upper case with the first base corresponding to tiie 76* 

10 base pair of the leader intron. Variations found in the AXRCC3-DT40 consensus are 
indicated in italic capitals below. The mutations are shown in lower case letters above the 
consensus with those from AXRCC2-DT40 in black and those from AXRCC3-DT40 in 
mid-grey. All mutations falling into the point mutation and ambiguous categories are 
included. Correction has been made for clonal expansion as described previously (Takata 

15 et aL, 1998) so each lower case letter represents an independent mutational event. The 
majority of the 27 mutations thereby removed from the original database of 158 are at one 
of the seven major hotspots; the correction for clonality will, if it gives rise to any 
distortion, lead to a underestimate of hotspot dominance. Of the seven major hotspots 
(identified by an accumulation of > 5 mutations), five conform to the AGY consensus 

20 sequence on one of the two strands as indicated with black boxes. Nucleotide 
substitution preferences (given as a percentage of the database of 13 1 independent events) 
as shown in panel (e) are deduced from the point mutations in sequences from unselected 
AXRCC2- and AXRCC3-DT40. A similar pattem of preferences is evident if the 
AXRCC2/AXRCC3 databases are analysed individually. 

25 

The spontaneous VX mutation frequency in wildtype and AXRCC2/3-DT40 cells is 
analysed by PGR amplifying the rearranged VX segments from total (unsorted) DT40 
populations that have been expanded for 1 month following subcloning. The result 
reveals that there is indeed a much higher spontaneous accumulation of mutations in the 
30 AXRCC2 and AXRCC3 cells than in the parental DT40 (Fig. 21a, b). In AXRCC2.DT40 
cells, mutations accumulate in VA, at a rate of about 0.4 xlO"^ bp'^generation ^ (given an 
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approximately 12 hour division time), a value similar to that seen in the constitutively 
mutating human Buridtt lymphoma line Ramos. 

Somatic hypeimutation in germinal centre B cells in man and mouse is preferentially 
5 targeted to the rearranged immunoglobulin Vh and Vl segments. A similar situation 
applies to the point mutatioias in AXRCC2-DT40 cells. Thus, a significant level of 
apparent point mutation is also seen in the productively reananged VhI gene (Fig. 3c). 
However, this does not reflect a general mutator phenotype since mutation accumulation 
is much lower in Ck than in the rearranged YX and is also low in the unrearranged VX on 
10 the excluded allele where the apparent mutation rate does not rise above the background 
level ascribable to the PGR amplification itself (Fig. 21c). 

The distribution of the mutations over the domain in AXRCC2-DT40 cells is 
strikingly non-random. The mutations, which are predominantly single nucleotide 

15 substitutions, show preferential accumulation at hotspots that conform to an AGY 
(Y=pyrunidine) consensus on one of the two DNA strands (Fig. 21d). They also occur 
overwhelmingly (96%) at G/C. This G/C-biased, hotspot-focused hypermutation in 
AXRCC2-DT40 cells, although exhibiting somewhat less of a bias in favour of nucleotide 
transitions, is strikingly similar to the pattern of V gene hypermutation described in 

20 cultured human Burkitt lymphoma cells as weU as that occurring in vivo in firog, shark 
and Msh2-deficient mice (Rada et al, 1998; Diaz et al, 2001). The IgV gene 
hypermutation that occurs in vivo in man and normal mice appears, as previously 
discussed, to be achieved by this hotspot-focused G/C biased component acting in concert 
with a mechanism that targets A/T (Fig. 21e). 

25 

Thus, whereas the DT40 chicken bursal lymphoma line normally exhibits a low frequency 
of IgV diversification by gene conversion, a high frequency of constitutive IgV gene 
somatic mutation (similar in nature to that occurring in human B ceU lymphoma models) 
can be elicited by ablating Xrcc2 or Xrcc3. This provides strong support to the earlier 
30 proposal that IgV gene conversion and hypermutation might constitute different ways of 
resolving a common DNA lesion (Maizels et al, 1995; Weill et al, 1996). Recent data 
suggest that the initiating lesion could well be a double strand break (Sale & Neuberger, 
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1998; Papavasilou et al., 2000; Bross et al 2000) and it would th^efore appear 
significant that both Xrcc2 and Xrcc3 have been implicated in a recombination-dependent 
pathway of DNA break repair (Liu et al, 1998; Johnson et al, 1999; Pierce et al, 1999; 
Brennermaii et al, 2000; Takata et al, 2001). Indeed, a similar induction of IgV gene 
5 hypermutation in DT40 cells is achieved by ablating another gene (RAD51B) whose 
product is implicated in recombination-dependent repair of breaks (Takata et al, 2000) 
but not by ablating genes for Ku70 and DNA-PKcs which are involved in non-homologous 
end-joining. Figure 22 shows the analysis of sIgM-loss variants in DT40 cells deficient in 
DNA-PK, Ku70 and RadSlB. Fluctuation analysis of the fi-equency of generation of 

10 sIgM-loss variants after 1 month of clonal expansion is shown in panel (a). The median 
values obtained with wildtype and AXRCC2 DT40 are included for comparison. Pie 
charts depicting the proportion of VX sequences amplified from the sIgM-loss variants 
derived from two slgM^ Rad5 IB-deficient DT40 clones that carry various types of 
mutation as indicated are shown in panel (b). In addition, one sequence carried a 9 bp 

1 S deletion, one carried a 4 bp duplication and one carried a single base pau: msertion. 

The results, however, do not simply suggest that, in the absence of Xrcc2, a lesion which 
would normally be resolved by gene conversion is instead resolved by a process leading to 
somatic hypermutation. First, AXRCC2-DT40 cells retain the ability to perforai IgV gene 

20 conversion, albeit at a somewhat reduced level (Fig. 21b). Second, the frequency of 
hypermutation in AXRCC2-DT40 cells is about an order of magnitude greater than the 
frequency of gene conversion in the parental DT40 line. It is therefore likely that, in 
normal DT40 cells, only a minor proportion of the lesions in the IgV gene are subjected to 
templated repair from an upstream pseudogene thereby leading to the gene conversion 

25 events observed. We believe that the major proportion of the lesions are subjected to a 
recombinational repair using the identical V gene located on the sister chromatid as 
template and which is therefore 'invisible*. This would be consistent with the 
observations of Papavasiliou and Schatz (2000) who found that detectable IgV gene 
breaks in hypermutating mammalian B cells are restricted to the G2/S phase. In the 

30 absence of Xrcc2, Xrcc3 or RadSlB, we propose that the 'invisible' sister chromatid- 
dependent recombinational repair is perverted, resiilting in hypermutation. Whether this 
hypermutation reflects that the sister chromatid-dependent recombinational repair 
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becomes error-prone in the absence of Xrcc2/3 or whether it reflects an inhibition of such 
rq)air thereby revealing an alternate, non-templated mechanism of break resolution is an 
issue that needs to be addressed This question is not only important for an understanding 
of the mechanism of hypennutation but may also provide insight into the physiological 
S function of the RadS 1 paralogues. 

Example 9: Affinity maturation in Axrcc2 DT40 IgM 

A population of Xrcc2-deficient DT40 cells which had been expanded for several months 

10 was used to determine whether the action of hypennutation on the unique 
VhDJhA^lJl rearrangement in these cells could generate sufficient functional diversity to 
allow the evolution of maturing lineages of antibodies to a significant proportion of 
antigens tested. We used two methods for selection. In a first approach, cells were 
incubated with soluble aggregates formed by mixing FITC-streptavidin with different 

15 biotinylated antigens (casein, insxilin, ovalbumin (Ova), thyroglobulin (Tg) and a rat 
monoclonal antibody (Ab)); binding variants were then emriched by flow cytometry. 
Alternatively, variants were selected using magnetic beads coated with S. aureus Protein 
A or human serum albumin (HSA). None of the antigens tested showed detectable 
binding to the parental DT40 or its secreted IgM. After six sequential rounds of selection, 

20 there were essentially three types of outcome. In the case of insulin and casein, there was 
no evidence for enrichment of binding variants. In the cases of the rat monoclonal 
antibody and Protein A, specific binders were obtained. And in the cases of HSA, Ova 
and Tg, binders were obtained but these exhibited varying degrees of polyreactivity. Thus, 
with tile selection performed using an aggregate of biotinylated rat IgG mAb S7 with 

25 FITC-streptavidin (Fig. 23), specific binding was already evident by round three. 
Subsequent enrichments were performed using the rat IgG S7 mAb directly conjugated to 
phycoerythrin (PE). The cells obtained in round six (DT-Ab6) were specific for the rat 
IgG S7 mAb as judged by the lack of staining by a variety of other reagents. ELISA of the 
culture supernatant demonstrated that the binding of S7 mAb by DT-Ab6 cells was 

30 conferred by the DT-Ab6 IgM itself (Fig. 23b). This DT-Ab6 IgM is in fact an anti-S7 
idiotype. It recognises purified S7 mAb (and can be used for staining permeabilised S7 
hybridoma cells) but this interaction is not competable by other rat immunoglobulins (Fig. 
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23c, d). DT-Ab6 cells could be stained using high dilutions of PES7 mAb conjugate - 
suggesting a high afOnity of infection. By perfonning the staining at a fixed ratio of 
molecules PE-S7 mAb: DT-Ab6 cells but varying the volume, an aflSnity in the range of 
5.8 nM was deduced (Fig. 23f). 

5 

Sequence analysis revealed that DT-Ab6 VhA^i cany a total of 1 9 amino acid substitutions 
compared to the parental DT40 sequence (Fig. 23g). 

In the case of the bead selections performed using streptavidin beads coated with 

10 biotinylated Protein A, binding variants were evident by round two (DT-P2; Fig. 24a). 
However, many of the cells in the DT-P2 population were sticky in that they also bound to 
various other types of bead that displayed neither streptavidin nor Protein A. Further 
enrichments using tosylated beads to which Protein A had been directly conjugated 
yielded a population of cells (DT-P4) that could be stained with a FITC conjugated 

15 Protein A aggregate (Fig. 24b). Serial selection for binding to this aggregate by use of 
flow cytometry gave rise to a population (DT-P9) which could be stained weakly with 
unaggregated FITC-Protein A; further sorting with FITC-Protein A then gave rise to more 
brightly staining descendants (Fig. 24b). The IgM secreted by DT-P14 cells (as well as by 
several of its precursors) bound well to Protein A as judged by both ELISA and 

20 immunoprecipitation (Fig. 24c, d). Direct binding assays using radiolabelled Protein A 
indicated a substantial increase in affinity had occurred between DT-P9 and DT-P14 (Fig. 
24e), consistent with the flow cytometric analysis (Fig-24b). This mcrease, which is 
entirely due to a Ala->Val substitution adjacent to Vh CDR2 (Fig. 24h), yields an IgM 
with an apparent affinity for Protein A of about 0.32 nM (Fig. 24f). Interestingly, the 

25 interaction between DT-P IgM and Protein A differs from the well characterised 
interaction between Protein A and the Fc portion of rabbit IgG not just in the fact that the 
DT-P/Protein A interaction is with the V rather than C portion of the IgM molecule 
(mutations are found in Vnnot in C^) but also in the fact that it is likely that different sites 
on Protein A are used to interact with DT-P IgM and rabbit IgG since high concentrations 

30 of rabbit IgG do not inhibit the staining of DT-P4/P9 cells by FITC-Protein A (Fig. 24g). 
Indeed, an enhancement of staining is seen, presumably due to aggregation of Protein A 
by the rabbit IgG. 
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The sequence of the DT-P14 VHA^Lieveals 17 amino acid substitutions compared to the 
parental DT40 sequence. Five of these mutations are shared with the rat idiotype-specific 
variant DT-Ab6. However, at least four of the five mutations must have occurred 
5 independentiy in the two dynasties (rather than reflecting descent fix>m a common mutated 
precursor in the pool of Xrcc2-deficient DT40 cells) smce these four mutations common 
to DT-P14 and DT-Ab6 are not found in DT-P4. This tendency to repeat substitutions 
might in part reflect that mutation in DT40 is largely restricted to the hotspot-focussed 
GC-biased first phase of mutations (lacking some of the breadth of mutation that we have 

10 ascribed to the A/T-biased second phase; Sale and Neuberger, 1998; Rada et aL, 1998) 
although it is also possible that they confer an advantage by predisposing the antibody 
structure to maturability. It is notable that whilst Xrcc2-deficient DT40 cells retain the 
ability to perform IgV gene conversion (albeit at much lower frequency than somatic 
hypermutation)5, comparison of the V gene sequences in DT-P14 and DT-Ab6 cells (Fig. 

15 24h) witii tiiose of the germline V segments (Reynaud et aL, 1987; 1989) reveals that the 
changes are largely (if not exclusively) due to point mutations rather than gene 
conversions. The cells obtained after sbc rounds of selection using HSA-derivatised 
carboxylated magnetic beads (DT-H6) stained not only with FITC-HSA, but also with 
FrrC-Tg, FITC-Ova and Cychrome-conjugated streptavidin despite the fact that the DT- 

20 H6 cell population had never been exposed to these other antigens; fiirfher selections with 
HSA simply increased the brightness of polyspecific staining (Fig. 25a). A similar, though 
distinct, pattern of polyspecificity was evident in the cell population tiiat had been 
subjected to flow cytometric enrichment using complexes of FITC-streptavidin with either 
biotinylated Tg or biotinylated Ova (Fig 25a, d). Analysis of subpopulations revealed that 

25 the apparent polyspecificity is not a reflection of cellular heterogeneity. The polyspecific 
staining was mediated by the sxirface IgM itself since sIgM-loss variants lose antigen- 
binding activity. Furthermore, antigen binding is readily detected by ELISA of the culture 
supamatants (Fig. 25b, c). 

30 Polyreactive antibodies are well described in man and mouse (both in serum and amongst 
hybridomas), where the issue has been raised as to whether they constitute a good starting 
point for the evolution of monospecificity (Casali and Schettino, 1996; Bouvet and 
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Dighiero, 1998). The same issue arises in the in vitro selection system (though without the 
constraint of avoiding autoimmunity). We therefore tested whether it is possible to evolve 
the DT-06 population towards increased specificity for Ova by flow cytometric 
enrichment for FITC-Ovabn^ /Cychrome-streptavidindun cells. After multiple rounds of 
5 sorting, cells displaying greater specificity for FITC-Ova were obtained (Fig. 25d). It will 
be interesting to ascertain whether a polyspecific binding population provides a better 
starting point for the evolution of specificity than the parental DT40 population. 

The results presented here clearly demonstrate that hypermutating cell lines can be used 

10 for both the derivation and iterative maturation of antibodies in vitro. Given the genetic 
tractability of DT40, it is possible to extend the application to transfected IgV genes and 
thereby mature the aflSnity of existing antibodies. Furthermore, since the hypermutation 
mechanism can target heterologous genes put in place of the rearranged IgV segment 
(Y elamos et aL, 1995), it may well prove possible to extend the strategy to the maturation 

15 of other ligand/receptor pairs. With regard to the de novo selection of antibodies, it is 
striking that the action of somatic hypermutation on a single VhA^l rearrangement has 
generated a repertoire from which it has been possible to select and mature high afi5nity 
binders to two of the seven antigens tested as well as obtain signs of initial low afOnity 
(but maturable) bindmg to three of the others. This reflects the fact that very low antigen 

20 affinities suffice to initiate the selection: these binding sites are then maturable. Clearly, if 
v^shing to extmpolate the approach to allow the in vitro production of high-affinity 
human monoclonal antibodies, it would be advantageous to exploit the genetic tractability 
of DT40 so as to generate a primary repertoire that includes more than a single VhA^l 
rearrangement. The results obtained here indicate that this primary repertoire of 

25 rearrangements would be orders of magnitude smaller than the primary repertoire used in 
vivo. 

Example 10: Isolation of naturally-occuring constitutively hypermutating EBV 
positive BL cell lines 

30 

A survey of naturally occuning EBV^ BL ceD lines revealed an absence of a clearly 
identifiable population of sIgM-loss variants amongst many of them (e.g. Akata, BL74, 
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Chepa Daudi, Raji, and Wan). However, a clear slgM^*^ population was noted in two of 
these EBV^ cell lines, ELI-BL and BL16, suggestir^ an intrinsic hypermutation capacity. 
sIgM expression profiles of Ramos, EHRB, ELI-BL, and BLl 6 are shown in Fig.26a. The 
sIgM^°^ cell population is boxed and the percentage of cells therein indicated. Each dot 
5 represents one cell. Note that the sizable sIgM^°* population in BLl 6 is in part due to 
less intensely staining positive cells, which also occluded fluctuation analyses. ELI-BL 
harbors a type 2 EBV, resembles germinal center B cells, and expresses a latency gene 
repertoire consisting only of EBNAl and the non-coding EBER and Bam A RNAs 
(Rowe, et a/., 1987) BL16 also contains a type 2 virus but, in contrast to ELI-BL, it 
10 appears more LCL-like and expresses a full latency gene repertoire (Rooney et ai, 1984; 
RowQetaL, 1987). 

Although a clear slgM"^®^ population was visible in ELI-BL and BLl 6 cultures, it 
was important to address whether these variants could be attributed to bonafide 
hypermutation. This was assessed by fluctiation analysis. In brief, (sub)clones were 

15 transferred to 24 or 48 weU plates, maintained with fresh medium for 3 to 8 weeks, and 
analyzed by washing cells (1-2.5 x 10^) twice in PBS/3% FBS, staining (30 min on ice) 
with the relevant antibody or antibody combination (below) and again washing prior to 
analysis of at least 10"* cells by flow cytometry (FACSCalibur, Becton Dickinson). 
Antibodies used were R-phycoerythrin-conjugated, goat anti-human IgM (jii-chain 

20 specific; Sigma), fluorescein isothiocyanate (FITC)-conjugated, mouse monoclonal anti- 
Ramos idiotype tZL16/l (Zhang et a/.,1995); provided generously by M. Cragg and M. J, 
Glennie, Tenovus Research Laboratory, Southampton], and FITC-conjugated, goat anti- 
mouse IgM (Southern Biotechnology Associates, Inc.). Data were acquired and analyzed 
using CellQuest software (Becton Dickinson). 

25 Unless noted otherwise, cells compared in fluctuation analyses were derived, 

cultured, and analyzed in parallel. The median (as opposed to the mean) percentage of 
sIgM-loss variants amongst a number of identically-derived (sub)clones is xised as an 
indicator of a cells somatic hypermutation capacity to minimize the effects of early 
mutational events Fluctuation analysis of ELI-BL subclones revealed that the sIgM^°^ 

30 variants were indeed being generated at high frequency during in vitro culture (Fig. 26b; 
each cross represents the porcentage of cells falling within the slgM"^**^ window following 
a 1 month outgrowth of a smgle subclone; the median percentages are indicated), and Vh 
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sequence ajoalysis, in the case of BL16 subclones, confirmed that this instability reflected 
somatic hypennutation (Fig. 26c). Base substitution mutations are indicated in lower case 
letters above the 338 bp consensus DNA sequence in triplets of capital letters. 
Compl^entarity-detennining regions and partial PCR primer sequences are underlined 
5 and emboldened, respectively. The corresponding amino acid sequence is indicated by 
single capital letters. This consensus sequence differs at two positions &om GenBank 
entry gi.2253343 [TCA (Ser20) - TCT and AGC (Ser55) - ACC (Thr)]. 

Considerable Vh sequence diversity, mcluding several sequences with multiple 
base substitution mutations, and an overall high Vh mutation frequency indicated that 

10 hypermutation is ongoing in BL16. Moreover, despite the relatively smaU number of Vh 
sequences sampled, one dynastic relationship could be inferred [1^ mutation at Gly54 
(GGT - GAT); 2"^ mutation at VaI92 (GTG - ATG)]. Finally, like Ramos, most of tiie 
BL16 Vh base substitution mutations occurred at G or C nucleotides (24/33 or 73%) and 
clustered within the complementarity determining regions (underlined in Fig. 26c). Thus, 

15 several halhnarks of ongoing hypermutation were also distinguishable in two natural 
EBV*" BL cell lines, one expressing a limited latency gene repertoire and the other 
e}q>ressing a full combination. It was therefore clear that somatic hypermutation can 
proceed unabated even in the presence of EBV. 



20 
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1, A method for preparing a cell line capable of directed constitutive hypennutation 
of a target nucleic acid region, comprising screening a cell population for ongoing target 

5 sequence diversification, and selecting a cell in which the rate of target nucleic acid 
mutation exceeds that of other nucleic acid mutation by a factor of 100 or more. 

2. A method according to claim 1, wherein the cell line is a lymphoid cell line. 

10 3. A method according to claim 2, wherein the cell line is derived from an 
immunoglobulin-expressing cell. 

4. A method according to any preceding claim, wherein the cell line expresses the 
target nucleic acid region in a manner that facilitates selection of cells comprismg mutants 

IS of said region. 

5. A method according to claim 4, wherein the cell line expresses the gene product 
encoded by the target nucleic acid region on the cell surface. 

20 6. A method according to any preceding claim, wherein the cell line is derived from 
or related to a cell type which hypennutates in vivo. 

7. A method according to claim 6, wh^ein the cell line is a Burkitt lymphoma, 
follicular lymphoma or diffuse large cell lymphoma cell line. 

25 

8. A method according to any preceding claim, further comprising the steps of 
isolating one or more cells which display target sequence diversification, and comparing 
the rate of accumulation of mutations in the target sequences with that in non-target 
sequences in the isolated cells. 

30 

9. A method according to any preceding claim, wherein the target sequence is an 
immunoglobulin V-gene sequence. 
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10. A method according to claim 9, wherein the cells are screened by assessing loss of 
an expressed immunoglobulin. 

5 11. A method according to any one of claims 1 to 9, >A4ierein the cells are screened by 
assessment of mutation rates by direct sequencing of the target sequences. 

12. A method according to any one of claims 1 to 9, wherein the cells are screened by 
an immunofluorescence technique. 

10 

13. A method according to any preceding claim, wherein the rate of mutation in the 
cell is modulated by the administration of a mutagen or the expression of a sequence- 
modifying gene product 

15 14. A method according to any preceding claim, wherein the rate of mutation in the 
cell is modulated by genetic manipulation. 

15. A method according to claim 14, vviierein one or more genes involved in DNA 
repair are manipulated. 

20 

16. A method according to claim 15, wherein said one or more genes are RadSl 
analogues and^or paralogues. 

17. A method according to claim 15, wherein the genes are selected from the group 
25 consisting of Rad51b, RadSlc and analogues and/or paralogues thereof. 

18. A method for preparing a gene product having a desired activity, comprising the 
steps of: 

a) expressing a nucleic acid encoding the gene product in a population of 
30 cells according to claim 1 , operably linked to a sequence which directs hypermutation; 

b) identifying a cell or cells wifliin the population of cells which expresses a 
mutated gene product having the desired activity; and 
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c) establishing one or more clonal populations of cells from the cell or cells 
identified in step (b), and selecting from said clonal populations a cell or cells which 
expresses a gene product having an improved desired activity. 

5 19. A method according to claim 18, wherein the cell or cells direct constitutive 
hypermutation to an endogenous V gene locus. 

20. A method according to claim 18 or claim 19, A^erein flie control sequences which 
direct hypennutation are selected from sequences occurring downstream of a J gene 

10 cluster. 

21. A method according to claim 20, wherein the control sequences comprise 
elements Ei/MAR, Ck plus flanking regions and E3' as defined according to Klix et al, 
(1998) Eur J. Immunol. 28:317-326. 

15 

22. A method according to aay one of claims 1 8 to 21, wherein tfie nucleic acid region 
operatively linked to control sequences which direct hypermutation is an exogenous 
sequence inserted into the cell or cells. 

20 23. A method according to claim 22, wherein the exogenous sequence comprises a 
heterologous coding sequence operably linked to control sequences homologous to the 
cell or cells which dkect hypennutation. 

24. A method according to claim 23, wherein an endogenous V region coding 
25 sequence is replaced by a heterologous coding sequence. 

25. A method according to any one of claims 18 to 24, wherein the gene product is an 
immunoglobulin. 



30 



26. A method according to any one of claims 18 to 25, wherein the gene product is a 
DNA binding protein. 
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27. A method according to any one of claims 1 8 to 26, wherein the desired activity is a 
bmding activity. 

28. A method according to any one of claims 1 8 to 27, wherein the gene product is an 
5 enzyme. 

29. A method according to any one of claims 18 to 28, wherein steps b) and c) are 
iteratively repeated. 

10 30 . The use of a cell capable of directed constitutive hypermutation of a specific 
nucleic acid region in the preparation of a gene product having a desired activity. 

31. A cell capable of directed constitutive hypermutation, wherein said cell is a 
genetically manipulated chicken bursal lymphoma cell line. 

15 

32. A cell capable of directed constitutive hypermutation, wherein said cell is a 
genetically manipulated chicken DT40 cell. 



33. A cell according to claim 32, selected from the group consisting of A xrcc2 DT40 
20 and A xrcc3 DT40. 
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Box No. Vm (iv) DECLARATION: INVENTORSHIP (only for the purposes of the designation of the Umted States of America) 
The declaration must conform to the follo^^g standardized see Notes to Boxes Nos. WZT, VIII 0) to (v) 

fin general) and (he specific Notes to Box No. VUI fivj. If this Box is not used, this sheet should not be inchtded in the request 

Declaratton of inventorship (Rules 4.17(iv) and 516w.l(a)(iv)) 
for the purposes of the designation of the United States of America: 

I hereby declare that I believe I am the original, first and sole (if only one mventoris listed below) or joint (if more than one inventor 
is listed below) inventor of the subject matter which is claimed and for which a patent is sought 

This declaration is directed to the international application of which it forms a part (if filing declaration with application). 

This declaration is directed to international application No. PCTT/ .9.?92^9?^ (if furnishing declaration pursuant 

to Rule 26rer). 

I hereby declare that my residence, mailing address, and citizenship are as stated next to my name. 

I hereby state that I have reviewed and understand the contents of the above-identified interaationa] application, including the claims 
of said application. I have identified in the request of said application, in compliance with POT Rule 4.10, any claim to foreign priority, 
and I have identified below, under the heading "Prior Applications," by application number, country or Member of the World Trade 
Organization, day, month and year of filing, any application for apatent or inventor's certificate filed in a country other than the United 
States of America, including any POT international application designating at least one country other than the United States of America, 
having a filing date before that of the application on which foreign priority is claimed. 

Prior Applications: .uS 09/879;813 filed -I f th June 20G1 a US 10/1^,-5QS filed •15th'May2G02- 



I hereby acknowledge the duty to disclose information that is known by me to be material to patentability as defimed by 
37 CF.R. § 1 .56, mcluding for continuation-in-part applications, material information which became available between the filing date 
of the prior application and the PCT international filing date of the continuation-in-part application. 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief 
are believed to be true; and ftirther that these statements were made with the knowledge that willfiil false statements and tiie like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of the United States Code and that such willfiil 
false statements may jeopardize the validity of the application or any patent issued thereon. 

Name- ^^LE, Julian Edward 

Residence: .worn , 

(city and either US state, if applicable, or country) 

MaiUiig Address: .p^RG -Laboratory of IVloleetilar Biology,- HiHs Road,- Cambridge, CBl 2QH; Unttect Kingdom- • 



Citizenship: . 
Inventor's Signature: . 




(if not contained in the request, or if declaration is corrected or 
added under Rule 26/er after the filing of the international 
application. The signature must be that of the inventor, not that of 
the agent) 



T>.u.:..k.U/.9.'?r. 

(of signature which is not contained in the request, or of the 
declaration that is conecled or added under Rule 26/er after the 
filing of the international application) 



Name: .NEV?^R)3ER, Michael Samuel 

Residence: .VPi^dKingdpm 

(city and either US state, if applicable, or country) 

MaiUng Address: .MRCLaboratoiy of MoIecular.BioJogy.. HiUs Road,. Cambridge,. CB2 2QH, United Kingdom. . 



atizenship: British 

Inventor's Signature: . . H. 

(if not contained in the request, or if declaration is corrected or 
added under Rule 26;er after the filing of the international 
application. The signature must be that of the inventor, not that of 
the agent) 



Date:.<(-/^/o2, 

(of signature ivhich is not contained in the request, or of the 
declaration that is corrected or added under Rule 26ter after the 
filing of the international application) 



This declaration is continued on the following sheet, "Continuation of Box No. VIII (iv)" 



Form PCT/RO/lOl (declaration sheet (iv)) (March 2001; xcptmt July 2001) See Notes to the request form 
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Box No. VHI Ov) DECLARATION: INVENTORSHIP (only for the purposes of the deagnatton of the United States of America) 

The declaration must conform to the following standardized see Notes to Boxes Nos, VUl VIB&to (y) 

On genera}) and the specie Notes to Box No. VUI (rv). If this Box is not used, this sheet should not be included in the request 

Declaration of inventorship (Rules 4.17(iv) and 516tf,l(a)(iv)) 
for the purposes of the designation of the United States of America: 

I hereby declare that I believe I am the original, first and sole (if only one inventor is listed below) or joint (if more Aan one inventor 

is listed below) inventor of the subject matter which is claimed and for which a patent is sought. 

This declaration is directed to the international application of ^ch it fonns a part (if filing declaration with application). 

This declaration is directed to international appUcation No. PCT/ .?.?P.^ 9^^. (if furnishing declaration pursuant 

to Rule 26ter). 

I hereby declare that my residence, mailing address, and citizenship are as stated next to my name. 

I hereby state that I have reviewed and understand the contents of the above-identified international application, inchiding the claims 
of said application. I have identified in the request of said application, in con^liance with PCT.Rule4.10, any claim to foreign priority, 
and I have identified below, under the heading "Prior Applications," by application number, country or Member of the World Tfflde 
Organization, day, month and year of filing, any application for a patent or inventor's certificate filed in a country other than the United 
States of America, including any PCT international application designating at least one country other than tiie United States of America, 
having a filing date before that of the application on which foreign priority is claimed. 

Prior Applications: .uS G9/879;813-fited 41th June 2001 A US iO/146;5Q5 fi^ed •16tlTMay-2GG2- 



I hereby acknowledge the duty to disclose information that is known by me to be material to patentability as defined by 
37 C.F.R. § 1 .56, including for continuation-in-part applications, material information which became available between the filing date 
of the prior application and the PCT international filing date of the continuation-in-part application. 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief 
are believed to be true; and further that these statements were made with the knowledge that willful fiilse statements and the like so 
madearcpunishableby fine or imprisonment, or both, under Section 1001 ofTitle ISofthe United States Code and diat such willfiil 
false statements may jeopardize the validity of the application or any patent issued thereon. 

Name: .9.M'^^^[^.^' ^?!^^ 

Residence: .V!^^^K»n9^9D ! 

(city and either US state, if applicable, or country) 

Mailing Address: .|virg -Laboratory of- Molecular Biology,- HIHs Road,- Cambridge, • CB2 2QH/ United Kingdom- • 



Citizenship: 



British 



Inventor's Signature: 
(if not contained m the Tta^stt drif declaration is corrected or 
added under Rule'^'S^te^fter the filing of the international 
application. The signature must be that of the inventor, not that of 
the agent) 



Date:..6/m^ 

(of signature which is not contained in the request, or of the 
declaration that is corrected or added under Rule 26/er after the 
filing of the international application) 



Name: 

Residence: .* 

(city and either US state, if applicable, or country) 

Mailing Address: 



Citizenship: 

Inventor's Signature: 

(if not contained in the request, or if declaration is corrected or 
added under Rule 26ter after the filing of the international 
appUcation. The signature must be that of the inventor, not that of 
the agent) 



Date: 

(of signature which is not contained in the request, or of the 
declaration that is corrected or added under Rule 26ter after the 
filing of the international appUcation) 



Q This declaration is continued on the following sheet, "Continuation of Box No. VUI (iv)". 
Form PCT/RO/101 (declaration sheet (iv)) (March 2001; reprint July 2001) 



See Notes to the request form 
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Job : 494 

Date: 9/13/2006 
Time: 4:50:58 PM 



To David Marsh/Atty/DC/AmoldAndPorter@APORTER 

cc "KELLEY, THOMAS E [AG/2551 r 

<thomas.e.kelley@monsanto.com>, Thomas 
Holsten/Atty/DC/AmoldAndPorter@APORTER 

bcc 

Subject RE: FW: GB1 final rejection: 38-15(52913)0 (serial no. 
10/839,092) 




"UNSON. MIA D [AGy255ir 
<mla.d.unson@monsanto.co 
m> 

09/13/2006 03:27 PM 



I'm happy with that. 
Mia 

Original Message 

From : David_Marsh@aporter . com [mail to : David_Marsh@aporter . com] 
Sent: Wednesday, September 13, 2006 3:21 PM 
To: UNSON, MIA D [AG/2551] 

Cc: KELLEY, THOMAS E [AG/2551]; Thomas_Holsten@aporter . com 
Subject: Re: FW: GBl final rejection: 38-15 (52913) C (serial no. 
10/839, 092) 



In light of below I would adopt the last set of draft claims. David 



"UNSON, MIA D 
[AG/2551] " 
<mia . d . unson@mo 
nsanto . com> 



09/13/2006 
03:07 PM 



<David_Marsh@aporter . com> , 
<Thomas_Holsten@aporter . com> ^ 
"KELLEY, THOMAS E [AG/2551] " 
<thomas . e . )celley@Monsanto . com> 



To 



cc 



Sub j ect 

FW: GBl final rejection: 
38-15(52913)0 (serial no. 
10/839,092) 



I as]ced Paolo to checlc for cleavage sites and this is what he sent me. 
Mia 



Original Message 

From: CASTIGLIONI, PAOLO [AG/2551] 

Sent: Wednesday, September 13, 2006 3:01 PM 

To: UNSON, MIA D [AG/2551] 

Sxabject: RE: GBl final rejection: 38-15(52913)0 (serial no. 



10/839,092) 



Mia, 



I did check for cleavage signal peptide using the "Sigcleave" tool in 
BITS (sigcleave predicts the site of cleavage between a signal sequence 
and the mature exported protein. The predictive accuracy is estimated to 
be around 75-80% for both prokaryotic auid eukaryotic 
proteins) 



The only putative hit was for: 
PLPLLLLEQFAPS at residues 69-81 



I did check for Chloroplast transit peptide using the "ChloroP" in BITS 
toolset (The ChloroP server predicts the presence of chloroplast transit 
peptides (cTP) in protein sequences and the location of potential cTP 
cleavage sites) 



No hit reported 



Just as comment, I do not understand how the Liu peptide sequence was 
predicted from the cDNA sequence (it start with Q) . 



Let me know if you would like a more detailed or sophisticate analysis. 



Paolo 



Original Message 

From: UNSON, MIA D [AG/2551] 

Sent: Wednesday, September 13, 2006 2:32 PM 

To: CASTIGLIONI, PAOLO [AG/2551] 

Subject: FW: GBl final rejection: 38-15(52913)0 (serial 

no. 10/839,092) 
Inportance : High 



Paolo, 



Here are the two sets of sequences 



GBl case: 

GBl case 52913C SEQ ID NO 1: 

MIPYATAAEAEGALGRT^m^AETAWYEYSAVMPDSWLHCHTTFILFVIYSIAPLPLLLLEQFAPSVVLPYKL 
QPRVRLPPAASLSCYMDAACIFPLAVGLQFVSYPAVAKILRTRMGLPLPSVRETIAQLWYSLVEDYLSYWM 
HRLLHTQWCYEKIHRVHHEFTAPTGFAMSYSHWAENWLSIPALAGPVLVPCHVTTQWLWFSIRLIEGINTH 



SGYHFPFSPCRLIPFYGGAAYHDYHHYAGGRSQSNFAPLFTYCTYLYRTDKGYRYHKLKQEKLKSIiAENSAD 
KGGNYSFDEGKKNRYFCA 



GBl case 52913C SEQ ID NO 19 

ATGATCCCCTACGCGACTGCGGCGGAGGCGGAGGGAGCACTGGGGCGCACCATGACGTGGGCTGAGACAGCA 

TGGTACGAGTACTCGGCGGTGATGCCAGATTCCTGGCTGCACTGCCACACCACATTTATCCTGTTCGTCAT^ 

TACAGCATCGCCCCGCTGCCCCTGCTACTCCTAGAGCAGTTCGCTCCGTCCGTCGTGCTGCCGTACAAGCTG 

CAGCCCCGGGTACGGCTGCCCCCGGCAGCCTCCCTCAGCTGCTACATGGACGCGGCCTGCATCTTTCCGCTC 

GCCGTTGGCCTTCAGTTCGTCTCCTATCCTGCGGTCGCCAAGATACTAAGGACCCGAATGGGACTGCCGTTG 

CCGTCGGTGAGGGAGACCATCGCGCAGCTAGTCGTATACTCTCTAGTGGAGGATTACCTCAGCTACTGGATG 

CACCGTCTGCTGCACACCCAGTGGTGCTACGAGAAGATCCACCGCGTCCACCACGAGTTCACGGCTCCTACA 

GGCTTCGCCATGTCGTACAGCCACTGGGCCGAGAACGTCGTCCTTTCTATCCCGGCCTTGGCCGGCCCAGTG 

CTCGTGCCATGCCATGTCACCACGCAGTGGCTATGGTTCTCCATCCGCCTAATTGAGGGCATTAACACGCAC 

AGCGGTTACCATTTCCCGTTCAGCCCTTGCAGGCTGATTCCATTCTACGGAGGGGCTGCATACCATGACTAC 

CATCACTATGCAGGAGGCCGTAGCCAAAGCAACTTTGCACCCCTGTTCACCTACTGTG^ 

ACAGACAAAGGCTACAGATACCACAAGCTAAAGCAAGAGAAGCTGAAGAGTCTAGCAGAAAATAGTGCGGAT 

AAAGGAGGCAACTACTCATTCGACGAAGGGAAAAAGAACAGATATTTTTGTGCCTGA 



Jumbo case- -underlined sequence corresponds to the GBl case 
sequences. Question is- -would someone of ordinary skill in the 
art predict a cleavage site in the j\ambo polypeptide sequence 
(SEQ ID NO. 52,139) SO that the mature polypeptide corresponds 
exactly to the GBl protein (SEQ ID NO. 1 above) 



Liu Jumbo 53313B SEQ ID NO. 52,139: 
QTYVGRSLAGFEGPRS 

MIPYATAAEAEGALGRTMTWAETAWYEYSAVMPDSWLHCHTTFILFVIYSIAPLPLLLLEQFAPSVVLPYKL 
QPRWLPPAASLSCYMDAACIFPIiAVGLQFVSYPAVAKILRTRMGLPLPSVRETIAQLVVYSLVEDYLSYWM 
HRLLHTQWCYEKIHRVHHEFTAPTGFAMSYSHWAENVVLSIPALAGPVLVPCHVTTQWLWFSIRLIEGINTH 
SGYHFPFSPCRLIPFYGGAAYHDYHHYAGGRSQSNFAPLFTYCDYLYRTDKGYRYHKLKQEKLKSLAENSAD 
KGGNYSFDEGKKNRYFCA 



Liu J\jmbo 53313B SEQ ID NO 25,824 

CGAACAGTTGAAGCTACTAGCGTGTAGCTA6GGAAGAGAAGCGCGCGAGTAGCTAGCAGACGTACGTAGGCA 
GAAGCCTAGCTGGGTTTGAAGGGCCCCGATCG 

ATG 

ATCCCCTACGCGACTGCGGCGGAGGCGGAGGGAGCACTGGGGCGCACCATGACGTGGGCTGAGACAGCATGG 
TACGAGTACTCGGCGGTGATGCCAGATTCCTGGCTGCACTGCCACACCACATTTATCCTGTTCGTCATCTAC 
AGCATCGCCCCGCTGCCCCTGCTACTCCTAGAGCAGTTCGCTCCGTCCGTCGTGCTGCCGTACAAGCTGCAG 
CCCCGGGTACGGCTGCCCCCGGCAGCCTCCCTCAGCTGCTACATGGACGCGGCCTGCATCTTTCCGCTCGCC 
GTTGGCCTTCAGTTCGTCTCCTATCCTGCGGTCGCCAAGATACTAAGGACCCGAATGGGACTGCCGTTGCCG 
TCGGTGAGGGAGACCATCGCGCAGCTAGTCGTATACTCTCTAGTGGAGGATTACCTCAGCTACTGGATGCAC 
CGTCTGCTGCT^CACCCAGTGGTGCTACGAGAAGATCCACCGCGTCCACCACGAGTTCACGGCTCCTACA 
TTCGCCATGTCGTACAGCCACTGGGCCGAGAACGTCGTCCTTTCTATCCCGGCCTTGGCCGGCCCAGTGCTC 
GTGCCATGCCATGTCACCACGCAGTGGCTATGGTTCTCCATCCGCCTAATTGAGGGCATTAACACGCACAGC 
GGTTACCATTTCCCGTTCAGCCCTTGCAGGCTGATTCCATTCTACGGAGGGGCTG(^TACCATGACTACCAT 
CACTATGCAGGAGGCCGTAGCCAAAGCAACTTTGCACCCCTGTTCACCTACTGTGATTATTTATATAGGACA 
GACAAAGGCTACAGATACCACT^GCTAAAGCAAGAGAAGCTGAAGAGTCTAGC^ 
GGAGGCAACTACTCATTCGACGAAGGGAAAAAGAACAGATATTTTTGTGCC 
TGA 

GCGTACGAAGAATAATCAAGGCTATTACTTCGTCCTGTTCGAAGGGAAGATTTGCAAATAAATAATTCGAAT 
TTACTGCAGAAGCTCTTGATTGGTCGACGAACAAATATATTGTTGCAATCCGriTCGTGTATO 



TAATATGAAACTTTTTTGTCGGAATATATGGATCATCCCGATACATTACITTATAAGTAATG^ 
AAGTTTTTGTTGG 



Thanks, 
Mia 



This e-mail message may contain privileged and/or confidential 
information, and is intended to be received only by persons entitled to 
receive such information. If you have received this e-mail in error, 
please notify the sender immediately. Please delete it and all 
attachments from any servers, hard drives or any other media. Other use 
of this e-mail by you is strictly prohibited. 



All e-mails and attachments sent and received are subject to monitoring, 
reading and archival by Monsanto. The recipient of this e-mail is solely 
responsible for checking for the presence of "Viruses" or other 
"Malware". Monsanto accepts no liability for any damage caused by any 
such code transmitted by or accompanying this e-mail or any attachment. 



This communication may contain information that is legally privileged, 
confidential or exempt from disclosure. If you are not the intended 
recipient, please note that any dissemination, distribution, or copying 
of this communication is strictly prohibited. Anyone who receives this 
message in error should notify the sender immediately by telephone or by 
return e-mail and delete it from his or her computer. 



David Marsh David_Marsh@aporter.com 

Arnold & Porter LLP Telephone: 202-942-5068 

555 Twelfth Street, NW Fax: 202-942-5999 

Washington, DC 20004-1206 

For more information about Arnold & Porter LLP, click here: 
http : //www . arnoldporter . com 



This e-mail message may contain privileged and/or confidential information, 
and is intended to be received only by persons entitled to receive such 
information. If you have received this e-mail in error, please notify the 
sender immediately. Please delete it and all attachments from any servers, 
hard drives or any other media. Other use of this e-mail by you is strictly 
prohibited. 



All e-mails and attachments sent and received are subject to monitoring, 
reading and archival by Monsanto. The recipient of this e-mail is solely 
responsible for checking for the presence of "Viruses" or other "Malware". 
Monsanto accepts no liability for any damage caused by any such code 
transmitted by or accompanying this e-mail or any attachment. 
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Job : 506 
Date: 9/13/2006 
Time: 4:53:06 PM 



Coolidge v. Efendic 
Schedule For Preliminary Motion Phase 
(Proposed Amended Times) 


Action 


Time 


Date 


Declaration (according to accompanying default 
schedule) 


16 May 06 


Real Party In Interest 


14 days from 
Declaration t, 


..:p^ Tue 
fi^m. 30 Mav 06 


Request for file copies 


14 days from -0^^ 
Declaration ^5!^' 


"^^^ Tue 
%l^ay 06 


Clean Copy of Claims 


14 days froji^P^ 
Declaratiom^^ ^ 


^ 30 Mi^06 


Lead/Backup counsel 


14 days from 
Definition ^ 


> Tue 

30 May 06 


Related proceedings 


Declai-gion 


^ Tue 

30 May 06 


Drawing analysis 


%clarati'^^ 


^ Tue 

13 Jun 06 


Notify Board of^^K^ 
incomplete files ^ 


^idays froi^^rder 
^||^|fimng fijk 


Tue 
20 Jun 06 


ProposlpmSt^r^list 


^ business day prior 
Itj^onference call 


Clin 

oun 
09 Jul 06 


Conf^^ce call to se^^^ 
dates ^^^^ ^ 




Tue 
11 Jul 06 


Notice of confi!|lgntial,;^^ 
information vi^^-^ 


2 months from 

Declaration 


Sun 

16 Jul 06 


Last day to initiate^^ 
settlement conference 


3 months from 
declaration - but see 
SO 126.4, requiring 
joint statement 
before conference 
call 


Wed 
16 Aug 06 


TIME PERIOD 1-29 Aug 06 


Preliminary Motions due 




Tue 
29 Aug 06 
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Cooiidge v. Efendic 
Schedule For Preliminary Motion Phase 
(Proposed Amended Times) 


Action 


Time 


Date 


File priority statements 




Tue 
29 Aug 06 


Serve objections to 
preliminary motion 
evidence 




05 Sep 06 


Serve priority statements 




Tue 
^j^ep 06 


Serve corrected evidence 




19 S^6 


TII^E PERIOD 2 %aS^06 


File motions responsive 
to preliminary motions 




Tue 

^ 19 Sep 06 


Serve objections to 
preliminary motion ^^^^^ 
evidence ^-^^ 




Tue 

^ 26 Sep 06 


Serve corrected^jeji^dence 




Tue 
10 Oct 06 


Cross-exa^matipn o'^l^ 


ir... 


Tue 
10 Oct 06 


Crd%s|fexamination cMses 




Sat 
21 Oct 06 


^"^^^ PERIOD 3 - 31 Oct 06 


File oppositio^to 
preliminary moti'^p? 




Tue 
31 Oct 06 


Serve objections to 
opposition evidence 




Tue 
04 Oct 06 


Serve corrected 

opposition evidence 




Tue 

21 Nov 06 


Cross-examination opens 




Tue 
21 Nov 06 


Cross-examination closes 




Thu 
07 Dec 06 
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Coolidge v. Efendic 
Schedule For Preliminary Motion Phase 
(Proposed Amended Times) 


Action 


Time 


Date 


TIME PERIOD 4-18 Dec 06 


File replies to oppositions 
to preliminary motions 






Mon 
18 Dec 06 


Serve objections to reply 
evidence 


d 




Tue 
26 Dec 06 


Serve corrected reply 
evidence 




^|gn 07 


Cross-examination opens 




09 JanW 


Cross-examination closes 


"^^^^fe^Sfe. ^^^^ 




Fri 

19 Jan 07 


TIME PERIOD 5Jv29 Jartm. 


File request for oral 
argument ^5^^^ 






Mon 
29 Jan 07 


File motions to^^lllude 
evidence ^^^^k. 




Mon 
29 Jan 07 


File observations orf^^^ 

cross-emmrnatijoji^of 

reph^^clarants'^^^ 




Mon 
29 Jan 07 




"^^ImT^ERIOD 6-12 Feb 07 


File opposi|ipns to ^a 
motions to Mclude ^ 
evidence 




Mon 
12 Feb 07 


File responses to 
observations on cross- 
examination of reply 
declarants 




Mon 
12 Feb 07 


TIME PERIOD 7-20 Feb 07 


File replies to oppositions 
to motions to exclude 
evidence 




Tue 
20 Feb 07 


TIME PERIOD 8-27 Feb 07 
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Coolidge v. Efendic 
Schedule For Preliminary Motion Phase 
(Proposed Amended Times) 


Action 


Time 


Date 


File exhibits 




Tue 
27 Feb 07 


File sets of motions 




j^W 27 Feb 07 


TIME PERIOD 9 - 27 I^J 




Present oral argument 




""^.Tue 
2^7^r 07 


TIME PERIOD 10 - C^^y 07 (Es^mated) "^^^ 


Panel decision on 
preliminary motions 




p Estimated 
01 May 07 


2"^ settlement 
conference 


vipil^onths^ 
afte^^ne^^^on 

lynotio^^^^^in^ 
^atemer^^icating 
^od faili^ffort to 
^settle befo^ this 


Estimated 
01 Jul 07 
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